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SiRNA Mediated Gene Silencing in Transgenic Animals 

The present Invention relates to a process that enables constitutive and 
inducible gene Icnock down in living organisnns using a short hairpin RNA 
expression vector integrated Into the genome of the organism. 

Background of the Invention 

RNA interference (RNAi) has been discovered some years ago as a tool for 
Inhibition of gene expression (Fire, A. et al.. Nature 391, 806-811 (1998)). It 
based on the Introduction of double stranded RNA (dsRNAX molecules into 
cells, whereby one strand is complementary to the coding region of a target 
gene. Through pairing of the specific mRNA with the introduced RNA molecule, 
the mRNA is degraded by a cellular mechanism. Since long dsRNA provolces an 
interferon response in mammalian cells, the technology was initially restricted 
to organlsmSr.or cells showing no Interferon response (Bass, B.L, Nature 411, 
428-429 (2001)). The finding that short (<30 bp) mterfering RNAs (sIRNA) 
circumvent the interferon response extended the application to mammalian 
cells (Elbashir, S.M. et ai,. Nature 411, 494-498 (2001)). 

Although RNAI In mice has been in principle demonstrated, the current 
technology does not allow performing systematic gene function analysis In 
vivo. So far the Inhibition of gene expression has been achieved by Injection of 
purified siRNA into. the tail vain of mice (l^cCaffrey, A.P. et al., Nature 418, 38- 
39 (2002); Lewis, D.H. et a!.. Nature Genet. 32, 107-108 (2002)). Using this 
approach, gene inhibition is restricted to specific organs and persists only a 
few days, A further improvement of the sIRNA technology is based on the 
intracellular transcription of short hairpin RNA (shRNA) molecules using gene 
expression vectors (see Fig. 1; Brummelkamp, T.R. et al.. Science 296, 550- 
553 (2002); Paddison, P.J. et al. Genes Dev. 16, 948-958 (2002); Yu, J.Y. et 
al., Proc. Natl. Acad. Scl. USA 99, 6047-6052 (2002); Sul, G. et al., Proc. Natl. 
Acad. Sci. USA 99, 5515-5520 (2002); Paul, CP. et al., Nature Blotechnoi. 20, 
505-5.08 (2002); Xia, H. et al., Nat. Blotechnoi. 10, 1006-10 (2002); Jacque, 
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J.M. et al.. Nature 418(6896):435-8 ( 2002)). The activity of shRNA in mice, 
has been demonstrated by McCaffrey et al., 2002 through Injection of shRNA 
expression vectors into the tall vain. Again, gene Inhibition was temporally and 
spatially restricted. Although these results demonstrate that the mechanism of 
shRNA mediated gene silencing Is functional In mice, they do not clarify 
whether constitutive RNAI can be achieved in transgenic animals. 
Brummelkamp, T.R. et al.. Science 296, 550-553 (2002)^ Paddison, P.J. et al., 
Genes Dev. 16, 948-958 (2002), Hemann, M.T. et al., Nat. Genet. .33(3):396- 
'400 (2003); and Devroe, E. et al., BMC Bfotechnol. 2(1):15 (2002) have 
shown the long-term inhibition of gene expression through stable integration 
of shRNA vectors in cultivated cell lines. These experiments included random 
integration of shRNA transgenes and screening for clones with appropriate 
siRNA expression, which is not applicable for testing of a large number of 
different shRNA transgenes in mice. Finally, several reports have 
demonstrated shRNA-mediated gene silencing in transgenic mice and rats 
(Hasuwa, H.,et a!., FEBS Lett. 532(172): 227-30 (2002); Carmell, M.A. et al., 
Nat. Struct. Biol. 10(2):91-2 (2003); Rubinson, D.A, et al., Nat. Genet. 
33(3):401-6 (2003); Kunath, T. et al., Nat. Biotechnol. (Apr. 7 2003)). 
However, these experiments again Included random integration of shRNA 
transgenes resulting In variable levels and patterns of shRNA expression. Thus, 
testing of ES cell clones or mouse lines with appropriate shRNA expression had 
been required, which Is a laborious and time-consuming undertaking. 

The In vivo validation of genes by RNAI mediated gene repression in a 
large scale setting requires the expression of sIRNA at sufficiently high levels 
and with a predictable pattern in multiple organs. Targeted transgenesis 
provides the only approach to achieve reproducible expression of transgenes in 
the living organism (e.g. mammalians such as mice). It has been, however, 
questionable whether a single copy of a siRNA expression vector integrated 
into the genome would result in sufficiently high levels of sIRNA required for 
RNAi-mediated gene inhibition In multiple organs of the living organism. 

Most siRNA expression vectors are based on polymerase III dependent 
(Pol III) promoters (U6 or HI) that allow the production of transcripts carrying 
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only a few non-homologous bases at their 3' ends. It has been shown that the . 
presence of non-homologous RNA at the ends of the shRNA stretches lower 
the efficiency of RNAI mediated gene silencing (XIa et aL, Nat. Blotechnol. 10, 
1006-10 (2002)). However, it has been unpredictable for a person skilled in 
the art which genomic region within the living organism promotes an 
appropriate expression pattern of the Pol III promoter driven shRNA constructs 
required for RNAI-mediated gene Inhibition in multiple organs of the living 
organism. 

For the temporal control of RNAi mediated gene silencing in transgenic 
cells lines and living organism, a tight system for inducible siRNA expression is 
needed. Inducible gene expression systems based on the tetracycline 
dependent repressor are l<nown. It is, however, not obvious whether a. stably 
integrated, single copy configuration of these systems can be created that 
allows Inducible RNAi in multiple organs without background activity. 

Summary of the Invention 

It has now been found that the modification of murine embryonic stem (ES) 
cells by the introduction of shRNA expression vectors through targeted 
transgenesis allows the inhibition of gene expression In mice generated from 
these modified ES cells. The Invention thus provides 

(1) the use of an expression vector comprising a short hairpin RNA (shRNA) 
construct under control of a ubiquitous promoter for preparing an agent for 
constitutive and/or inducible gene knock down (hereinafter also referred to as 
"RNAi mediated inhibition of gene expression") In a living organism, in tissue 
culture or in cells of a cell culture; 

(2) a method for constitutive and/or inducible gene knock down in . a living 
organism, in tissue culture or in cells of a cell culture which comprises stably 
integrating an expression vector as defined in (1) above into the genome of 
the living organism, in the genome of the tissue culture or into the genome of 
the cells of the cell culture; 

(3) a living organism, tissue culture or cell culture having stably Integrated, 
preferably at a polymerase II dependent locus of the living organism, tissue 
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culture or cells of the cell culture, an expression vector as defined In (1) 
.above; and 

(4) an expression vector as defined In (1) above. 

The agent of embodiment (1) is suitable for in vivo and In vitro applications. 

The living organisms in embodiments (1) to (3) above are preferably 
vertebrates. In embodiments (2) and (3) the living organisms are more 
preferably non-human vertebrates. The tissue cultures and the cells of the cell 
cultures of (1) to (3) above are preferably derived from vertebrates. The 
method of embodiment (2) includes in vivo and in vitro reaction schemes. 

Short Description of thie Figures 

Fig. 1: Mechanism of RNAi through shRNA expression. A: expression of shRNA. 
B: shRNA processing. C: Pairing of siRNA with complementary mRNA strands. 
D: Cleavage of mRN A. 

Fie 2: Pol III dependent shRNA expression vector (constitutive). Insertion of 
an shRNA expression vector into a ubiquitously expressed genomic locus. 
Transcription through the upstream Pol II dependent promoter will be stopped 
by a synthetic polyadenylation signal (pA) and the hGH pA. The Pol III 
dependent promoter controls the expression of shRNA. The transcript is 
stopped by five thymidine bases. The exact sequence of suitable target vectors 
is given In SEQ ID Nos:6 and 8. 

Fig. 3: Tet repression system. tTS is a doxycycline controlled transcriptional 
silencer consisting of the tet repressor and the KRAB-AB domain of the Kid-1 
protein. This fusion protein binds to the pol III dependent promoter via the tet 
operator sequences only In the absence of doxycycline and represses 
transcription. 

Fig. 4; Vectors for Pol III dependent promoter based tet-repression system 
(inducible). Insertion of a shRNA expression vector into a ubiquitous expressed 
genomic locus. The transcription of the Pol II dependent promoter will be 
stopped by the synthetic polyadenylation signal (pA). and a hGH pA. An 
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inducible Pol III dependent promoter controls the expression of shRNA. The 
transcript is stopped by five thymidine bases. 

Fig, 5: Principle of the Doxycycline Inducible gene expression system- rtTA2M2 
is a doxycycline-controlled transcriptional activator consisting of the reverse 
tet repressor and the VP16 activation domain of the Herpes simplex virus VP16 
protein. This fusion protein binds to a minimal CMV-promoter via the tet 
operator sequences in the presence of doxycycline and activates transcription. 
tTS functions in a similar fashion as explained in Fig. 3. 

Fig. 6: Vectors for Pol II dependent promoter based tet-system (inducible). 
Insertion of a shRNA expression vector into a ubiquitous expressed genomic 
locus. The rtTA2M2 is expressed under the control of the ubiquitous Pol II 
dependent promoter, stopped by a synthetic polyadenylation signal (pA) and a 
hGH pA. An Inducible Pol II dependent promoter controls the expression of 
shRNA. The transcript is stopped by a short polyadenylation signal. 
Figure 7: ShRNA-mediated inhibition of luciferase gene expression in 
embryonic stem cells. All cells carried stably Integrated expression cassettes of 
firefly- (Flue) and Renilla luclferases (Rluc). Simultaneous expression of a Fluc- 
speclfic ShRNA under control of the HI- (Hl-shRNA) or U6-promoter (U6- 
shRNA) resulted In a '^IS^/o and '^60% reduction of firefly luciferase activity, 
respectively. The values of Flue activity were normalized by using the Rluc 
activity for reference. 

Fig. 8: Vectors for a U6 promoter based tet-repression system (Inducible). 
Transcription of the Pol II promoter will be stopped by the synthetic 
polyadenylation signal (pA) and a hGH pA sequence. The inducible U6 
promoter controls expression of the Fluc-speclfic shRNA. Transcription of the 
shRNA is stopped by five thymidine bases. Tet R is a doxycycline controlled 
repressor (Gossen and Bujard, PNAS. 89: 5547-5551 (1992)) expressed by 
the CAGGS-promoter. The tetR binds to the pol III promoter via the tet 
operator sequences only in the absence of . doxycycline and blocl<s 
transcription. 

Fig. 9: Vectors for HI promoter based tet-repression system (inducible). The 
inducible HI promoter controls the expression of shRNA. TetR is a doxycycline 



wo 2004/035782 



ff/EP2003/011233 



- 6 - 



controlled repressor expressed by the CAGGS-promoter. The tetR binds to the 
pol III promoter via the tet operator sequences only in the absence of 
doxycydine and blocks transcription. 

Fioure 10: Inducible RNA interference in embryonic stem cells using 
tetracycline dependent U6 promoters. All cells carried stably Integrated 
expression cassettes of firefly luclferase (Flue), Renllla luclferase (Rluc), and 
the tet repressor. Doxycycllne-lnducible expression of a Fluc-specific shRNA 
under control of the U6tetoi- (Art4.12 RNAi 23/36 C2) or U6teto2- promoter 
(Art4.12 RNAI 24/36 A-Dll) resulted in a -^30% and -^50% reduction of firefly, 
luciferase activity, respectively. The values of Flue activity were normalized by 
using the Rluc activity for reference (figure 12). 

Figure 11: Indudble RNA interference using tetracycline: dependent HI 
promoters NIH3T3. Cells were transiently transfected with firefly luciferase. 
(Flue), tet repressor, LacZ, and shRNA expression vectors. Doxycycllne- 
lnducible expression of the Fluc-speclfic shRNA under control of the Hlteto2/5'-/ 
Hlteto2/3'-/ oi; Hlteto2/5'+3'-promoter resulted In a ~40%/ '^'80%, and ~40% 
reduction of firefly luclferase activity, respectively. The values of Flue activity 
were normalized by using the LacZ activity for reference. 
Figure 12: Experimental strategy. Genes of the firefly and Renllla luclferases 
are expressed through the endogenous rosa26 promoter via splice acceptor 
sites. Specific shRNA mediated repression of the firefiy luciferase gene can be 
measured by using the Renllla luciferase activity as a reference. 
Figure 13: ShRNA-mediated inhibition of luciferase expression in mice. 
A: Firefly luciferase activity in mice in the absence (delta shFluc, black bars) or 
presence of the U6- (U6-shFluc, white bars) and Hl-shRNA transgenes (Hl- 
shFluc, grey bars), respectively. All mice carried the firefly and the Renllla 
luciferase transgenes. Relative values of Firefly luciferase activltly in different 
organs are given as indicated. All values of Flue activity were normalized by 
using the Rluc activity for reference. 

B: Efficiency of shRNA- mediated repression of firefiy luciferase expression In 
mice. Percentages of U6- (white bars) and Hl-shRNA mediated (grey bars) 
repression of Firefly luciferase activity are shown. ND: not determined. 
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Detailed Description of the Invention 

The term "living organisms" according to. the present invention relates to 
multi-cell organisms which can be living vertebrates (hereinafter shortly 
referred to as "vertrebrates") such as mammals (e.g.. non-human animals such 
as rodents (Including mice and rats, etc.; and humans) or non-mammals (e.g. 
fish) or can be invertebrates such as Insects or worms, or can be plants 
(higher plants, algi or fungi). Most preferred living organisms are vertebrates. 
In particular mice and fish. 

"Tissue culture" according to the present invention refers to parts of the above 
defined living organisms (including organs and the like) which are cultured in 
vitro. 

"Cell culture" according to the present Invention includes cells isolated from 
the above defined living organisms which are cultured in vitro. These cells can 
be transformed (immortalized) or untransformed (directly derived from living 
organisms; primary cell culture). 

The expression vector according to the invention of the present application Is 
suitable for stable integration into the living organism or into cells of the cell 
culture (vectors for transient Integration are also contemplated). It is 
moreover preferred that it contains homologous sequences suitable for 
targeted integration at a defined locus, preferably at a polymerase II 
dependent locus of the living organisms or cells of the cell culture. Such 
polymerase II dependent loci include, but are not limited to, Rosa26 locus (the 
murine Rosa26 locus being depicted in SEQ ID NO:l), collagen, RNA 
polymerase, actin and HPRT. 

The expression vector may further contain functional sequences selected from 
splice acceptor sequences (such as a splice acceptor of adenovirus, etc.)/ 
polyadenylation sites (such as synthetic polyadenylation sites, the 
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polyadenylation site of human growth hormones, or the like), selectable 
marker sequences (such as the neomycin phosphotransferase gene of E. coli 
transposon, etc.)/ etc. 

The ubiquitous promoter In the vector according to the Invention is preferably 
selected fronri polymerase I, II and III dependent promoters, preferably is a 
polymerase II or III dependent promoter Including, but not limited to, a CI^V 
promoter, a CAGGS promoter, a snRNA promoter such as U6, a RNAse P RNA 
promoter such as HI, a tRNA promoter, a 7SL RNA promoter, a 5 S rRNA 
promoter, etc. 

The ubiquitous promoter can be a constitutive promoter, or can be an 
inducible promoter. Suitable inducible promoters are the above-mentioned 
polymerase I, II and III dependent promoters containing an operator 
sequence Including, but not limited to, tet, Gal4, lac, etc. 

The expression vector of the Invention Is suitable for the following particularly 
preferred approaches (for constitutive and inducible expression): 

A. a Pol III dependent promoter (constitutive U6, HI or the like) driven 
shRNA construct (to be integrated Into a ubiquitously active Pol II 
dependent locus (see Fig. 2); 

B. a Pol III dependent promoter (inducible .U6, HI or the like) driven 
ShRNA construct (to be integrated into a ubiquitously active Pol II 
dependent locus (Fig 3 and 4)); or 

C. a polymerase II (Pol II) dependent promoter (inducible CIMV or the like) 
driven shRNA construct (to be integrated into a ubiquitously active Pol II 
dependent locus (Fig. 5 and 6)). 

The ShRNA within the vector of the Invention preferably comprises 
(I) at least one DNA segment A-B-C wherein 

A is a 15 to 35, preferably 19 to 29 bp DNA sequence being at least 
95%, preferably 100% complementary to the gene to be knocked down (e.g. 
firefly luciferase, p53, etc.); 
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B Is a spacer DNA sequence having 5 to 9 bp fornning the loop of the 
expressed RNA hair pin molecule, and 

C IS a 15 to 35, preferably 19 to 29 bp DNA sequence being at least 
85% connplementary to the sequence A; and 
(II) a stop and/or polyadenylatlon sequence. 

Suitable shRNA sequences for the knock down of a given target gene are well 
known in the art (see e.g. the particular shRNA sequences nrientioned in Tables 
1 and 2 below) or can readily be determined by the skilled artesian. 



Table 1: 



target 
gene 


shRNA sequence /SEQ ID NO 


Reference 


CDH-1 

p53 

CDC2G 


TgagaagtctcccagtcagTTCAAGAGActgactgggagacttctca (19) 
GactccagtggtaatctacTTCAAGAGAgtagattaccactggagtc (20) 
CggcaggactccgggccgaTTCAAGAGAtcggcccggagtcctgccg (21) 


Brummelkamp 
etal., 

Science, 296: 
550-3 (2002). 


CYLD 


CctcatgcagttctctttgTTCAAGAGAcaaagagaactgcatgagg (22) 


Kovalenko et 
al, Nature, 
424:801-5 
(2003). 


Ras- 
Gap 


AagatgaagccactccctatttCAAGAGAaaatagggagtggcttcatctt (23) 


Kunath et al.. 
Nature 

Biotechnology, 

21:559-561 

(2003). 


tubulin 


GacagagccaagtggactcACAgagtccacttggctctgtc (24) 


Yu et al., 
PNAS, 99: 
6047-52 
(2002) 


lamin 


Ctggacttccagaagaacattcgtgttcttctggaagtccag (25) 


Paul et al.. 
Nature Bio- 
technology, 
20:505-8 
(2002). 
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Table 2 shRNA sequences known from Brummelkamp et al.. Nature, 424:797- 



801 (2003): 

Target 

Gene 

UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 12 

UBIQUITIN • 
CARBOXYL- 
TERMINAL 
HYDROLASE 11 

UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 10 

HAUSP 



UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 8 

FU 10785 



KIAA0710 



FU 12552/ 
FU 14256 



KIAA1203 



shRNA Sequence / SEQ ID NO 

GAGATTGGTCCAGAACAGTTTCAAGAGAACTGTTCTGGACCAATCTC (26) 
G CCCT TCCGATCATGGTAGTTCAAGAGACTACCATGATCGGAAGGGC (27) 
TCTTTAGAATTCTTAAGTATTCAAGAGATACTTAAGAATTCTAAAGA (28) 
CATTAGCTATATCAACATGTTCAAGAGACATGTTGATATAGCTAATG (29) 

ACCACAAACGGCGGAACGATTCAAGAGATCGTTCCGCCGTTTGTGGT (30) 
GAGGGTCTTGGAGGTCTTCTTCAAGAGAGAAGACCTCCAAGACCCTC (31) 
GTCCATGCCCAGCCGTACATTCAAGAGATGTACGGCTGGGCATGGAC (32) 
GCTGGACACCCTCGTGGAGTTCAAGAGACTCCACGAGGGTGTCCAGC (33) 

GAATATCAGAGAATTGAGTTTCAAGAGAACTCAATTCTCTGATATTC (34) 
TGGACTTCATGAGGAAATGTTCAAGAGACATTTCCTCATGAAGTCCA (35) 
TATTGAATATCCTGTGGACTTCAAGAGAGTCCACAGGATATTCAATA (36) 
TTGTACTGAGAGAAACTGCrrCAAGAGAGCAGTTTCTCTCAGTACAA (37) 

. GAT CAATG ATAGGTTTGAATTCAAGAGATTCAAACCTATCATTGATC (38) 
, GGAGTTTGAGAAGTTTAAATTCAAGAGATTTAAACTTCTCAAACTCC (39) 
GAACTCCTCGCTTGCTGAGTTCAAGAGACTCAGCAAGCGAGGAGTTC (40) 
CCGAATTTAACAGAGAGAATTCAAGAGATTCTCTCTGTTAAATTCGG (41) 

GACAGCAGAAGAATGCAGATTCAAGAGATCTGCATTCTTCTGCTGTC (42) 
ATAAAGCTCAACGAGAACCTTCAAGAGAGGTTCTCGTTGAGCTTTAT (43) 
GGTGAAGTGGCAGAAGAATnrCAAGAGAATTCTTCTGCCACTTCACC (44) 
GTATTGCAGTAATCATCACTTCAAGAGAGTGATGATTACTGCAATAC (45) 

GATATGGGGTTCCATGTCATTCAAGAGATGACATGGAACCCCATATC (46) 
GGAGACATGGTTCTTAGTGTTCAAGAGACACTAAGAACCATGTCTCC (47) 
AGCACCAAGTTCGTCTCAGTTCAAGAGACTGAGACGAACTTGGTGCT (48) 
GATGCAACACTGAAAGAACTTCAAGAGA G I I C I I I CAGTGTTGCATC (49) 

GTCAATGGCAGTGATGATATTCAAGAGATATCATCACTGCCATTGAC (50) 
CCTG CTAG CTGCCTGTGGCTTCAAGAGAGCCACAGGCAGCTAGCAGG (51) 
CCACCTTTGCCAGAAGGAGTTCAAGAGACTCCTTCTGGCAAAGGTGG (52) 
CCCTATTGAGGCAAGTGTCTTCAAGAGAGACACTTGCCTCAATAGGG (53) 

GAAGGAAAACTTGCrGACGTTCAAGAGACGTCAGCAAGTTTTCCTTC (54) 
CTCACCTGGGTCCATGAGATTCAAGAGATCTCATGGACCCAGGTGAG (55) 
GCTGTCTTACCGTGTGGTCTTCAAGAGAGACCACACGGTAAGACAGC (56) 
CCTGGACCGCATGTATGACTTCAAGAGAGTCATACATGCGGTCCAGG (57) 

GTCAATGGCAGTGATGATATrCAAGAGATATCATCACTGCCATTGAC (58) 
CCTG CTAG CTGCCTGTGGCTTCAAGAGAGCCACAGGCAGCTAGCAGG (59) 
CCACCTTTGCCAGAAGGAGTTCAAGAGACTCCTTCTGGCAAAGGTGG (60) 
CCCTATTGAGGCAAGTGTCrTCAAGAGAGACACTTGCCTCAATAGGG (61) 
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FU23277 GGAAATCCGAATTGCTTGGTTCAAGAGACCAAGCAATTCGGATTTCC (62) 

CACATTTCTTCAAGTGTGGTTCAAGAGACCACACTTGAAGAAATGTG (63) 
CAGCAGGATGCTCAAGAATTTCAAGAGAATTCTTGAGCATCCTGCTG (64) 
GCTGAATACCTACATTGGCTTCAAGAGAGCCAATGTAGGTATTCAGC (65) 

FU14914 (similar GGGCTTGTGCCTGGCCTTGTTCAAGAGACAAGGCCAGGCACAAGCCC (66) 
to UBP4) GCCrTGTCCTGCCAAGAAGTTCAAGAGACTTCTTGGCAGGACAAGGC (67) 

GATTGAAGCCAAGGGA ACGTT CAAGAGACGTTCCCTTGGCTTCAATC (68) 
TGGCGCCTGCTCCCCATCTTTCAAGAGAAGATGGGGAGCAGGCGCCA (69) 

UBIQUITIN CAR- GAACCAGCAGGCtCTGTGGTrCAAGAGACCACAGAGCCTGCTGGTTC (70) 
BOXYL-TERMINAL GGAAGCATAAT TATCTG CCTTCAAGA6AGGCAGATAATTATGCTTCC (71) 
HYDROLASE AGAAGAAGATGCTTTTCACrrCAAGAGAGTGAAAAGCATCTTCTTCT (72) 
ISOZYME L5 CTTGCAGAGGAGGAACCCATTCAAGAGATGGGTTCCTCCTCTGCAAG (73) 

UBIQUITIN CAR- GCAAACAATCAGCAATGCCTTCAAGAGAGGCATTGCTGATTGnTGC (74) 
BOXYL-TERMINAL TTGGACrGATTCATGCTATTTCAAGAGAATAGCATGAATCAGTCCAA (75) 
HYDROLASE CTGGCAATTCGTTGATGTATTCAAGAGATACATCAACGAATTGCCAG (76) 
ISOZYME L3 TTAGATGGGCGGAAGCCATTTCAAGAGAATGGCTTCCGCCCATCTAA (77) 

UBIQUITIN CAR- GAGGAGTCTCTGGGCTCGGTrCAAGAGACCGAGCCCAGAGACTCCTC (78) 
BOXYL-TERMINAL GAGCTGAAGGGACAAGAAGTTCAAGAGACTTCTTGTCCCTTCAGCTC (79) 
HYDROLASE TGTCGGGTAGATGACAAGGTTCAAGAGACCTTGTCATCTACCCGACA (80). 
ISOZYME LI CACAGCTGTTCTTCTGTTCTTCAAGAGAGAACAGAAGAACAGCTGTG (81) 

klAA1891 / GTGGAAGCCTTTACAGATCTTCAAGAGAGATCTGTAAAGGCTTCCAC (82) 

FU25263 CAACAGCTGCCTTCATCTGTTCAAGAGACAGATGAAGGCAGCTGTTG (83) 

CCATAGGCAGTCCTCC TAATr CAAGAGATTAGGAGGACTGCCTATGG (84) 
TGTATCACTGCCACrGGTTTTCAAGAGAAACCAGTGGCAGTGATACA (85) 

FU 14528 (similar CATGTTGGGCAGCTGCAGCrtCAAGAGAGCTGCAGCTGCCCAACATG (86) 
to UBP8) CACAACTGGAGACCTGAAGTTCAAGAGACTTCAGGTCrCCAGTTGTG (87) 

GTATGCCrCCAAGAAA GAGT TCAAGAGACT C I I I LI IG GAGGCATAC (88) 
CrrCACAGTACATTTCTCTTTCAAGAGAAGAGAAATGTACTGTGAAG (89) 

U4/U6 TRI SNRNPGTACrnrCAAGGCCGGGGTTTCAAGAGAACCCCGGCCTTGAAAGTAC (90) 
65 kDa protein CTTGGACAAGCAAGCC AAATT CAAGAGATTTGGCTTGCTTGTCCAAG (91) 
GACTATTGTGACTGATGnTTCAAGAGAAACATCAGTCACAATAGTC (92) 
GGAGAACTTTCTGAAGCGCTTCAAGAGAGCGCTTCAGAAAGTTCTCC (93) 

XM_089437 GACGAGAGAAACCT TCACCT TCAAGAGAGGTGAAGGTTTCTCTCGTC (94) 

ACATTATTCTACATTCTTTTTCAAGAGAAAAGAATGTAGAATAATGT (95) 
AGATTCGCAAATGGATGTATTCAAGAGATACATCCATTTGCGAATCT (96) 
CATTCCCACCATGAGTCTGTTCAAGAGACAGACrCATGGTGGGAATG (97) 

KIAA1453 GATCGCCCGACACTTCCGCTTCAAGAGAGCGGAAGTiSTCGGGCGATC (98) 

CCAGCAGGCCTACGTGCTGTTCAAGAGACAGCACGTAGGCCTGCTGG (99) 
. GCCAGCTCCTCCACAGCAOTCAAGAGAGTGCTGTGGAGGAGCTGGC (100) 
CGCCGCCAAGTGGAGCAGATTCAAGAGATCTGCTCCACTTGGCGGCG (101) 
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FU 12697 



GAAGATGCCCATGAATTCCTTCAAGAGAGGAATTCATGGGCATCTTC (102) 
CAAACAGGCTGCGCCAGGCTTCAAGAGAGCCTGGCGCAGCCTGTTTG (103) 
ACGGCCTAGCGCCTGATGGTTCAAGAGACCATCAGGCGCTAGGCCGT (104) 
CTGTAACCTCTCTGATCGGTTCAAGAGACCGATCAGAGAGGTTACAG (105) 



UBIQUITIN 
SPECinC . 
PROTEASE 18 
(USP18) 



TCTGTCAGTCCATCCTGGCTTCAAGAGAGCCAGGATGGACTGACAGA (106) 
TGAAGCGAGAGTCTTGTGATTCAAGAGATCACTkAGACTCTCGCTTCA (107) 
GATGGAGTGCTAATGGAAATTCAAGAGATTTCCATTAGCACTCCATC (108) 
CCTTCAGAGATTGACACGCTTCAAGASAGCGTGTCAATCTCTGAAGG (109) 



UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 20 

UBIQUITIN. 
CARBOXYL- 
TERMINAL 
HYDROLASE 24 



CCTGACCACGTTCCGACTGTTCAAGAGACAGTCGGAACGTGGTCAGG (110) 
GAGTTCCnrCGCTGCCT GATr CAAGAGATCAGGCAGCGAAGGAACTC (111) 
GACTGCCTTGCTGCC; I ICI I I CAAGAGAAGAAGGCAGCAAGGCAGTC (112) 
CGCCGAGGGCTACGTACTCrTCAAGAGAGAGTACGTAGCCCTCGGCG (113) 

GGCGAGAAGAAAGGACTGTTTCAAGAGAACAGTC CI I IC I IC TCGCC (114) 
GGACGAGAATTGATAAAGATTCAAG AGATCT7TATCAATTCTCGTCC (115) 
GCACGAGAATTTGGGAATCTTCAAGAGAGATTCCCAAATTCTCGTGC (116) 
CTACTTCATGAAATATTGGTTCAAGAGACCAATATTTCATGAAGTAG ( 1 17) 



KIAA1594 GATAACAGCTTCrrGTCTATTCAAGAGATAGACAAGAAGCTGTTATC (118) 

GAGAATAGGACATCAGGGCTTCAAGAGAGCCCTGATGTCCTATTCTC (119) 
CTTGGA AGAC TGAACCTGTTTCAAGAGAACAGGTTCAGTCTTCCAAG (120) 
CAACrCCTTTGTGGATGCATTCAAGAGATGCATCCACAAAGGAGTTG (121) 



KIAA1350 



UBIQUITIN 
CARBOXYL- 
TERMINAL . 
HYDROLASE 25 



GATGTTGTCrCCAAATGCATTCAAGAGATGCATTTGGAGACAACATC (122) 
CGTGGGGACTGTACCTCCCTTCAAGAGAGGGAGGTACAGTCCCCACG (123) 
GTACAGCrrCT^GAACCAAGTTCAAGAGACTTGGTTCTGAAGCTGTAC (124) 

GATGATCTTCAGAGAGCAATTCAAGAGATTGCTCTCTGAAGATCATC (125) 
GGAACATCGGAATTTGCCnTCAAGAGAAGGCAAATTCCGATGTTCC (126) 
GAGCTAGTGAGGGACTCrrrTCAAGAGAAAGAGTCCCTCACrAGCTC (127) 
GCAGGGTTCTTTAAGGCAATTCAAGAGATTGCCTTAAAGAACCCTGC (128) 



UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 16 



TCGATGATTCCrCTGAAACTTCAAGAGAGTTTCAGAGGAATCATCdA ( 1 29) 
GATAATGGAAATATTGAACTrCAAGAGAGTTCAATATTTCCATTATC (130) 
GTrCTTCATTTAAATGATATTCAAGAGATATCATrTAAATGAAGAAC (131) 
GTTAACAAACACATAAAGTTTCAAGAGAACTTTATGTGTTTGTTAAC (132) 



USP9X 



USP9Y 



GrrAGAGAAGATTCTTCGTrTCAAGAGAACGAAGAATCTTCTCTAAC (133) 
GTTGATTGGACAATTAAACTTCAAGAGAGTTTAATTGTCCAATCAAC ( 1 34) 
GGTTGAtACCGTAAAGCGCTTCAAGAGAGCGCrTTACGGTATCAACC (135) 
GCAATGAAACGTCCAATGGTTCAAGAGACCATTGGACGrrTCATTGC (136) 

AGCTAGAGAAAATTCTTCGTTCAAGAGACGAAGAATnTCTCTAGCT(137) 
GATCCTATATGATGGATGATTCAAGAGATCATCCATCATATAGGATC (138) 
GT I CI I CI I GTCAGTGAAATTCAAGAGATTTCACTGACAAGAAGAAC (139) 
CrTGAGCTTGAGTGACCACTTCAAGAGAGTGGTCACTCAAGCTCAAG (140) 
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UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 5 



GACCGGCCAGCGAGTCTACTTCAAGAGAGTAGACrCGCTGGCCGGTC (141) 
GGACCTGGGCTACATCTACTTCAAGAGAGTAGATGTAGCCCAGGTCC ( 142) 
CTCTGTGGTCCAGGTGCrCTTCAAGAGAGAGCACCTGGACCACAGAG (143) 
GACCACACGATTTGCCTCATTCAAGAGATGAGGCAAATCGTGTGGTC ( 144) 



UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 26 



TGGCTTGTTTATTGAAGGATTCAAGAGATCCTTCAATAAACAAGCCA (145) 
GTGAATTTGGGGAAG ATAATTCAAG AGATTATCTTCCCCAAATTCAC ( 146) 
CGCTATAGCTTGAATGAGTTTCAAGAGAACTCATTCAAGCTATAGCG (147) 
GATATCCTGGCTCCACACATTCAAGAGATGTGTGGAGCCAGGATATC ( 148) 



KIAA1097 



GAGCCAGTCGGATGTAGATTTCAAGAGAATCTACATCCGACTGGCTC ( 149) 
GTAAATTCTGAAGGCGAATTTCAAGAGAATTCGCCTTCAGAATTTAC (150) 
GCCCTCCTAAATCAGGCAATTCAAGAG ATTGCCTGATTTAGG AGGGC (151) 
GTTGAGAAATGGAGTGAAGTTCTVAGAGACTTCACTCCATTTCTCAAC (152) 



UBIQUITIN 
SPECIFIC 
PROTEASE 22 
(USP22) 



GCTTGG AAAATGCAAGGCGrrCAAGAGACGCCTTGCATTTTCCAAGC (153) 
CTGCATCATAGACCAGATCTTCAAGAGAGATCTGGTCTATGATGCAG (154) 
GATCACCACGTATGTGTCCTTCAAGAGAGGACACATACGTGGTGATC (155) 
TGACAACAAGTATTCCCTGtTCAAGAGACAGGGAATACI IGI IGTCA ( 1 56) 



UBIQUITIN- 
SPECinC 
PROCESSING 
PROTEASE 29 

CYLD 



GAAATATAAGACAG ATTCCrrCAAGAGAGGAATCTGTCTTATATTTC (157) 
CCCATCAAGTTTAGAGGATTTCAAGAGAATCCTCTAAACTTGATGGG (158) 
GGTGTCCCATGGGAATATATTCAAGAGATATATTCCCATGGGACACC (159) 
GAATGCCGACCTACAAAGATrCAAGAGATCTTTGTAGGTCGGCATTC (160) 

ft 

CAGTTATATTCTGTGATGTTTCAAGAGAACATCACAGAATATAACTG (161) 
GAGGTGTTGGGGACAAAGGTrCAAGAGACCTTTGTCCCCAACACCTC (162) 
GTGGGCrCATTGGCTGAAGTrCAAGAGAClTCAGCCAATGAGCCCAC (163) 
GAGCTACTGAGGACAGAAATTCAAGAGATTTCTGTCCTCAGTAGCTC (164) 



UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 2 



TCAGCAGGATGCTCAGGAGTTCAAGAGACTCCTGAGCATCCTGCTGA ( 1 65) 
GAAGTTCTCCATCCAGAGGTTCAAGAGACCTCTGGATGGAGAACTTC (166) 
GCCGGTCCCCACCAGCAGCTTCAAGAGAGCTGCTGGTGGGGACCGGC (167) 
CAGTCGGGAGTTGAGAGATTTCAAGAGAATCTCrCAACTCCCGAGTG (168) 



UBIQUITIN 
SPECIFIC 
PROTEASE 3 
(USP3) 

UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 23 



GCCCTTGGGTCTGTTTGACTTCAAGAGAGTCAAACAGACCCAAGGGC (169) 
CTCAACACTAAACAGCAAGTTCAAGAGACTTGCTGTTTAGTGTTGAG (170) 
GATTTCATTGGACAGCATATTCAAGAGATATGCTGTCCAATGAAATC (171) 
CATGGGGCACCAACTAATTTTCAAGAGAAATTAGTTGGTGCCCCATG (172) 

GGTGTCTCTGCGGGATTGTTTCAAGAGAACAATCCCGCAGAGACACC (173) 
. AGTTCAGTAGGTGTAGACTTTCAAGAGAAGTCTACACCTACTGAACT (174) 
GAGTTCCTGAAGCTCCTCATTCAAGAGATGAGGAGCTTCAGGAACTC (175) 
GGATnrGCTGGGGGCAAGGTTCAAGAGACCTTGCCCCCAGCAAATCC (176) 



UBP-32.7 



CTCAGAAAGCCAACATTCATTCAAGAGATGAATGTTGGCrTTCTGAG (177) 
CGCATTGTAATAAGAAGGTTTCAAGAGAACCTTCTTATTACAATGCG (178) 
GGGAGGAAAATGCAGAAATTTCAAGAGAATTTCTGCATTTTCCTCCC (179) 
TTACAAATTTAGGAAATACTrCAAGAGAGTATTTCCTAAATTTGTAA (180) 
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HOMO SAPIENS 
UBIQUITIN SPE- 
aRC PROTEASE 
13 (ISOPEP- 
TIDASE T-3) 

UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 28 

UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 14 

DUBl 



MOUSE USP27 
HOMOLOG 



UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 4 



GTTATGAATTGATATGCAGTTCAAG AGACTGCATATCAATTCATAAC (181) 
GTGATAACACAACTAATGGTTCAAGAGACCATTAGTTGTGTTATCAC (182) 
GTAGAGGAGAGTTCTGAAATTCAAGAGATTTCAGAACTCTCCTCTAC (183) 
GCCTCTAATCCTGATAAGGTTCAAGAGACCTTATCAGGATTAGAGGC ( 1 84) 

GATGATCTTCAGGCTGCCATTCAAGAGATGGCAGCCTGAAGATCATC (185) 
GTATGGACAAG AGCGTTGGTTCAAG AG ACCAACGCTCTTGTCCATAC (186) 
CGAACCCTTCTGGAACA GTTT CAAGAGAACTGTTCCAGAAGGGTTCG ( 187) 
GTGGCATGAAGATTATAGTTTCAAGAGAACTATAATCTTCATGCCAC (188). 

GGTGAACAAGGACTVGTATCTTCAAGAGAGATACTGTCCTTGTTCACC ( 1 89) 
GCAATAGAGGATGATTCTGTTCAAGAGACAGAATCATCCTCTATTGC (190) 
TCTGTGAATGCCAAAGTTCrrCAAGAGAGAACnTGGCATTCACAGA (191) 
CACACCAGGGAAGGTCTAGTTCAAGAGACTAGACCTTCCCTGGTGTG (192) 

GCAGGAAGATGCCCATGAATTCAAGAGATTCATGGGCATCTTCCTGC (193) 
GAATGTGCAATATCCTGAGTTCAAGAGACTCAGGATATTGCACATTC (194) 
TGGATGATGCCAAGGTCACTTCAAGAGAGTG ACCTTGGCATCATCCA (195) 
GCTCCGTGCTAAACCTCTCTTCAAGAGAGAGAGGTTTAGCACGGAGC (196) 

GCCTCCACCTCAACAGAGGTTCAAGAGACCrCTGTTGAGGTGGAGGC (197) 
CTGCATCATAG ACCAAATCTTCAAG AGAGATTTGGTCTATGATGCAG ( 1 98) 
GATCACTACATACATTTCCTTCAAGAGAGGAAATGTATGTAGTGATC (199) 
GTAAAGAGAGCAGAATGAATTCAAGAGATTCATTCTGCTCTCTTTAC (200) 

CGCGGGGCGCAGTGGTATCTTCAAGAGAGATACCACTGCGCCCCGCG (201) 
CAGAAGGQVGTGGGGAAGATTQ^AGAGATCTTCCCO^CTGCCTTCTG (202) 
GCCTGGGAGAATCACAGGTTTCAAGAGAACCTGTGATTCTCCCAGGC (203) 
ACCAGACAAGGAAATACCCTTCAAGAGAGGGTATTTCCTTGTCTGGT (204) 



TRE-2 CACATCCACCACATCGACCTTCAAGAGAGGTCGATGTGGTGGATGTG (205) 

GTCACAACCCAAGACCATGTTCAAGAGACATGGTCTTGGGTTGTGAC (206) 
CTCAACAGGACAAATCCCATTCAAGAGATGGGATTTGTCCTGTTGAG (207) 
TAGATCAATTATTGTGGATTTCAAGAGAATCCACAATAATTGATCTA (208) 

UBIQUITIN CAR- GGAA CACCTTATTGATGAATTCAAGAGATTCATCAATAAGGTGTTCC (209) 
BOXYL-TERMINAL OTrAACAGAAATTGTCTCTTCAAGAGAGAGACAATTrCTGTTAAAG (210) 
HYDROLASE 15 CCT ATGCA GTACAAAGTGGTTCAAGAGACCACTTTGTACTGCATAGG (211) 
(UNPH-2). GATCI I I iCI IGCTTTGGATTCAAGAGATCCAAAGCAAGAAAAGATC (212) 



KIAA1372 



BRCAl 

ASSOCIATED 
PROTEIN -1 



CAGCATCCTTCAGGCCTTATTCAAGAGATAAGGCCTGAAGGATGCTG (213) 
GATAGTGACTCGGATCTGCTTCAAGAGAGCAGATCCGAGTCACTATC (214) 
GACATCACAGCCCGGGAGTTTCAAGAGAACTCCCGGGCTGTGATGTC (215) 
GGACACAGCCTATGTGCTGTTCAAGAGACAGCACATAGGCTGTGTCC (216) 

GTGGAGGAGATCTACGACCTTCAAGAGAGGTCGTAGATCTCCTCCAC (217) 
CrCTTGTGCAACTCATGCCTTCAAGAGAGGCATGAGTTGCACAAGAG (218) 
ACAGGGCCCCTGCAGCCTCTTCAAGAGAGAGGCTGCAGGGGCCCTGT (219) 
GAAGACCTGGCGGCAGGTGTTCAAGAGACACCTGCCGCCAGGTCTTC (220) 
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A preferred embodiment of the method (2) of the Invention concerns the 
following steps: 



sense-strand of the coding region of a gene. (e.g. firefly luclferase; p53). 
Antlsense and sense-strand are separated by a spacer of 5 to 9 bp (?). 
2. Generation of constructs for the expression of the above mentioned 
shRNA under the control of a constitutive or inducible promoter (Pol II 
or Pol III dependent). 
. 3. Insertion of the mentioned expression constructs into a ubiquitous 
expressed Pol II dependent locus by homologous recombination in ES 
cells. 

4. Analysis of the constitutive and inducible Inhibition of gene expression 
(e.g. firefly luclferase; p53) in ES cells (e.g. through Western blot 
analysis). 

5. Generation of mice using the mentioned ES cells and analysis of the 
inhibition of gene expression in several tissues (e.g. firefly luclferase; 
p53; e.g. through Western blot analysis). 

The vector according to embodiment (4) of the invention Is suitable for stable 
or transient Integration. Said vector is suitable for gene transfer. 

The technology of the present application provides for the following 
advantages: 

(i) A stable and body wide inhibition of gene expression by generating 
transgenic animals (such as mice). 

(II) A reversible inhibition of gene expression using the inducible constructs. 



1. 



Generation of the short hairpin DNA containing the antlsense- and 



The invention Is furthermore described by the following examples which are, 
however, not to be construed so as to limit the Invention. 
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Examples 

Methods 

Plasmid construction: All plasmid constructs were generated by standard DNA 
cloning methods. The U6- and the Hl-promoter fragrrients were amplified from 
human genomic DNA. The Renilla luciferase (Rluc) and firefly luciferase (Flue) 
coding regions (Promega) were placed 3' of the adenovirus splice acceptor site 
(Friedrich and Soriano, Genes Dev. 9:1513-23 (1991); see SEQ ID NOs: 3, 5 
and 7) to facilitate transcription from the endogenous rosa26 promoter. For 
homologous recombination in embryonic stem cells, the expression cassettes 
were flanl<ed by homology regions for the Rosa26 locus (see SEQ ID NO:. 2). 

Cell culture: Culture and targeted mutagenesis of Fl (C57BL/6 x 129Sv/Ev) ES 
cells were carried out as previously described (Hogan,B., Beddington,R., 
Costantin!,F. and Lacy,E. (1994). A Laboratory Manual. In Manipulating the 
Mouse Embryo. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor 
NY.), pp. 253-289.). For Cre-mediated deletion, ES cells were transiently 
transfected with 5 pg of the ere expression plasmid pCAGGScre (SEQ ID NO: 
18). 24 h after transfection, 1000 ES cells were plated on a 10 cm dish. 
Individual clones were isolated and analyzed for cre-mediated recombination 
by Southern blot using standard protocols. 

For induction, cells were cultivated in cell culture medium containing 1 \jg/m\ 
Doxycycline (Sigma) for 2 days. 

For transient transfection of cells, mixtures of 350 ng supercoiled plasmid DNA 
pre-complexed with the FuGeneS transfection reagent (Roche Diagnostics 
GmbH, Mannheim, Germany) were added to the cells in 250 pi medium, 
according to the manufacturers protocol. Each plasmid mixture contained 50 
ng of the beta-galactosidase expression vector CMV-lacZ (SEQ ID NO: 17), 
100 ng of CAGGS-Fluc (SEQ ID NO: 15), 100 ng of the tet inducible HlshRNA 
vector (SEQ ID NO: 11, 12 or 13), and pUC19 to a total amount of 350 ng 
DNA. For each sample to be tested four individual wells were transfected. One 
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day after the addition of the DNA preparations, the cell culture medium was 
exchanged. . 

Luciferase measurement: Stably transfected cells: 25.000 cells were plated on 
a 24 well plate and analyzed 48h later. Luciferase activity was detected using 
the Dual-Glo™ Luciferase Assay System (Promega) and a Lumat LB 9507 
(EG&G Berthold) according to the manufacturers protocols. 
Transiently transfected cells: The cells were lysed 48 hours after transfection 
with 100 \j\/\Ne\\ of lysate reagent supplemented with protease inhibitors (B- 
galactosidase reporter gene assay Icit, Roche Diagnostics). The lysates were 
centrifuged and 20 |j| were used to determine the p-galactosidase activities 
using the B-galactosidase reporter gene assay (Roche Diagnostics) according 
to the manufacturers protocol in a Lumat LB 9507 luminometer (Berthold). To 
measure Luciferase activity, 20 pi of lysate was diluted into 250 pi assay buffer 
(50mr^ glycylglycin, 5mM MgCU, 5ml^ ATP) and the relative light units (RLU) 
were counted in a Lumat LB 9507 luminometer after addition of 100 pi of a 1 
mM Luciferin (Roche Diagnostics) solution. 

Generation of chimeric mice: Recombinant ES cells were Injected into 
blastocysts from Balb/C mice and chimeric mice were obtained upon transfer 
of blastocysts into pseudopregnant females using standard protocols (Hogan et 
al. Manipulating the Mouse Embryo: A Laboratory Manual. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor NY. 253-289 (1994)). 

Luciferase measurement in organs: Organs were homogenized at 4 in lysis 
buffer (O.IM KH2PO4, ImM DTT, 0.1% Triton® X-100) using a tissue grinder. 
Spin for 5' at 2000xg (4 ^C) to pellet debris and assay supernatant for Luc 
activities using the Dual Luciferase Assay (Promega, Inc.) according to the 
manufacturer protocol. 
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Example 1 

the firefly luclferase gene along with a splice acceptor sequence (Friedrlch and 
Soriano, Genes Dev. 9:1513-23 (1991)) was inserted into the first allele of the 
rosa26 locus by homologous recombination in ES cells. The shRNA expression 
cassettes under the control of the HI or the U6 promoter and a Renilla 
luciferase gene were inserted into the second allele of the rosa26 locus by 
homologous recombination. The Renilla luciferase gene was used as a 
reference to normalize the values of firefly luciferase activity. 

Deleting the loxP-flanked shRNA expression cassette by transient transfection 
of a ere expression plasmid generated the negative control. Figure 7 shows the 
expression of the firefly luciferase in the absence and the presence of the 
shRNA expression cassette. Expression of the shRNA under the control of the 
HI promoter resulted in a ^^75% reduction of firefly luciferase activity, 
whereas the repression mediated by the U6-shRNA construct was ^60%. 

Example 2 

The ShRNA expression cassette under the control of the U6 promoter 
containing tet-operator sequences, and a Renilla luciferase gene were inserted 
into the first allele of the rosa26 locus (figure 8 and 12; SEQ ID NOs:9 and 
10). The firefly luciferase gene under the control of the endogenous rosa26 
promoter and a tet repressor expression cassette (Gossen and Bujard, PNAS. 
89: .5547-5551 (1992)) was inserted Into the second allele of rosa26 by 
homologous recombination in ES cells (figure 8; SEQ ID NO: 14). To determine 
the extent of shRNA-mediated gene silencing, the cells were treated for 2 days 
in the presence or in the absence of 1 MQ/ml Doxycycline in the cell culture 
medium. Figure 10 shows the firefly luciferase activity measured in the cell 
extracts. Doxycycline inducible expression of the shRNA under the control of 
the UStetoi promoter (SEQ ID NO:9) resulted in a '^'30% reduction of firefly 
luciferase activity, whereas repression mediated by the U6teto2 -construct (SEQ 
ID NO 10) reached -50%. 
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Example 3 

NIH3T3 cells were transiently transfected with constructs expressing the 
luciferase and tet repressor genes (SEQ ID NOs:15 and 16) together with the 
shRNA constructs under the control of the HI promoter containing tet-operator 
sequences (sequence ID NOsrll, 12 and 13; figure 9). Figure 11 shows the 
expression of firefly luciferase in the absence and in the presence of 1 pg/ml 
dox-ycycllne. The highest degree of doxycycline inducible shRNA expression 
was acliieved with the Hlteto2/3'-promoter, resulting in a ~80% reduction of 
firefly luciferase activity... 

Example 4 

To examine the activity of the Rosa26/U6- and Hl-shRNA transgenes in vivo, 
recombinant ES cells were injected into diploid blastocysts and chimeric mice 
were obtained upon transfer of blastocysts Into pseudopregnant females. 
Figure 13 shows the activity of firefly luciferase in different organs in the 
presence and absence of the shRNA expression cassettes. The Renilla 
luciferase gene at the second Rosa26 allele served as a reference to normalize 
the values of firefly luciferase activity. Expression of the shRNA under the 
control of both, the U6 as welj as the HI promoter resulted In efficient 
repression of firefly luciferase activity in most organs (Rg. 13A), with the 
highest degree of reduction (>90%) in liver, heart, brain and muscle (Fig. 
13B). 
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CLAIMS: 

1. Use of an expression vector comprising a short hairpin RNA construct under 
control of a ubiquitous promoter for preparing an agent (shRNA) for 
constitutive and/or Inducible gene knock down In a vertebrate, or In tissue 
culture or cells of a cell culture derived from a vertebrate. 

2. The use of claim 1, wherein the expression vector 

(i) is suitable for stabile Integration into the genome of a vertebrate, or into 
the genome of the tissue culture or of cells of the cell culture; and/or 

(ii) contains homologous sequences suitable for integration at a defined 
genomic locus through homologous recombination, preferably at a polymerase 
II dependent locus in the genome of the vertebrate, in the genome of the 
tissue culture or in the genome of the cells of the cell culture including 
embryonic cells; and/or 

(ill) further .contains functional sequences selected from splice acceptor 
sequences, polyadenylation sites, selectable marker sequences, etc. 

3. The use of claim 2, wherein the polymerase II dependent locus is selected 
from a Rosa26, collagen, RNA polymerase, actin and HPRT locus. 

4. The use according to any one of claims 1 to 3, wherein \ 

(i) the . ubiquitous promoter is selected from polymerase I, II and III 
dependent promoters, preferably is a polymerase II or III dependent 
promoter, most preferably is a CMV promoter, a CAGGS promoter, a snRNA 
promoter such as U6, a RNAse P RNA promoter such as HI, a tRNA promoter, 
a 7SL RNA promoter, a 5 S rRNA promoter, etc.; and/or 

(ii) the ubiquitous promoter is a constitutive promoter, or . is an inducible 
promoter, preferably a promoter containing an pperator sequence selected 
from tet, Gal4, lac, etc.; and/or 

(ill) said vertebrate is a non-human vertebrate, preferably is a mouse or fish. 
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5, The use according to any one of clainns 1 to 4, wherein the expression 
vector is 

(a) a Pol III dependent promoter^ preferably constitutive HI or U6, driven 
shRNA construct suitable to be integrated into a ubiquitously active Pol 
II dependent locus; 

(b) a Pol III dependent promoter, preferably inducible U6 or HI, driven 
shRNA construct suitable to be integrated into a ubiquitously active Pol 
II dependent locus; or 

(c) a Pol II dependent promoter, preferably inducible CMV, driven shRNA 
. construct suitable to be integrated, into a ubiquitously active Pol II 

dependent locus, 

6. The use according to any one of claims 1 to 5, wherein the shRNA 
comprises 

(I) at least one DNA segment A-B-C wherein 

A is a 3,5 to 35, preferably 19 to 29 bp DNA sequence with at least 95%, 
preferably 100% complementarily to the gene to be knocked down; 

B is a spacer DNA sequence having 5 to 9 bp forming the loop of the 
expressed RNA hair pin molecule, and 

C is a 15 to 35, preferably 19 to 29 bp DNA sequence with at least 85% 
complementarily to the sequence A; and 

(II) a stop and or polyadenylation sequence. 

. 1. A method for constitutive and/or inducible gene knock down in a vertebrate, 
or in a tissue culture or cells of a cell culture derived from a vertebrate, which 
comprises stably Integrating an expression vector as defined in any one of 
claims 1 to 6 into the genome of the vertebrate, of the tissue culture or of the 
cells of the cell culture. 



8. The method of claim 7, wherein said vertebrate is a non-human vertebrate, 
preferably is a mouse or fish. . 
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9. The method of claim 7 or 8, wherein the expression vector is integrated at a 
polymerase dependent locus of the living organism,, tissue culture or cell 
culture. 

10. The method according to any one of claim 7 to 9, wherein the method for 
constitutive and/or inducible gene knock down in a vertebrate comprises 
integrating the expression vector into ES cells of the vertebrate, 

11. A vertebrate, or tissue or cell culture derived from a vertebrate having 
stably integrated, preferably at a polymerase II dependent locus of the 
vertebrate, tissue culture or cells of the cell culture, an expression vector as 
defined in any one of claims 1 to 6. 

12. The vertebrate tissue or cell culture of claim 11, which is or is derived from 
a non-human vertebrate, preferably mouse or xfish. 

I* 

13. An expression vector as defined in any one of claims 1 to 6. 
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<110> ARTEMIS Pharmaceuticals GmbH 

<120>- siRNA mediated .gene silencing in transgenic animals 

<130> 032282WO JH/BM/short 

<140> 
<141> 



<160> 18 . 

<170> Patentin Ver. 2.1 

•<210> 1 
•<211> 13139 
<212> DNA 

<213> Artificial Sequence 
<220> • • 

<223> Description of Artificial Sequence: Rosa2.6 locus 
sequence 



<400> 1 • 
aagcttctca 
actcctgtga 
cttttttttg 
tttagaaagg 
aaataccaaa 
ttttagtttt 
aggaacaaga 
caatctgaat 
ctaatgatcc 
aaactcagaa 
acatctaaaa 
attaaccatc 
ctccatctgg 
tgtaaacatg 
gaaattcgag 
atccaggtat 
agacgttaca 
* caggccagca 
. tgagaaaggc 
aaaactttcc 
ctaggatctt 
gttcttaaaa 
ccaggctaga 
agggagtcca 
agcagccatc 
ggggaggact 
gtggggggct 
tctcctgaaa 
gtagggggag 
agatgaagga 
agaagacgaa 
cccgaggaga 
tgtggttcgg 
gggcccgggc 
gcggcgccac 
cagaaaaggc 
actactgtgt 
ggcggccgcc 



cgtagcaacc 
tccaacacag 
gggggggggg 
aacaatccta 
ttccaaatat 
aaaafeggtca 
gccttgggtc 
aaccttaaag 
tcctgagaaa 
atctctaatt 
taacctctta 
aaaacacttt 
atcctcccca 
accttggtca 
aacctgaatc 
agacaaaacc 
caagtaacat 
gggcgggctg 
aaaggcgccc 
ttctgagcga 
ggcttgcacg 
gatcgctctc 
accctacaaa 
agaatgtgag 
tgagatagga 
gggcccccac 
cagggacaga 
agggtataaa 
gggaagagtc 
gagcctttct 
aagggcaagc 
gggaacgcag 
tgtctctttt 
tgcggcgcac 
ggcgtttcct 
ggagcgagcc 
tggcggactg 
cctcccccgg 



agagctccag 
gagcaacctt 
gggaacaagt 
aggtcacttt 
tcgttttaat 
attttccaaa 
atgtcctacc 
cctctaatcc 
actggaagaa 
atgtcagtat 
aagttttcct 
cctctcaata 
atcaaaagta 
caaatatgtc 
tttttaagta 
cagagcccag 
gagaaagcag 
cagcatcagt 
gatagaataa 
ttacctacta 
gattcatagg 
cacgccctag 
ttcaacaggg 
gtgggggtgg 
actggaaaac 
gagcgaccag 
aaaaaaagta 
cgtggagtag 
ctgacccagg 
ctctgggcaa 
atcttcctgc 
ggagactgag 
ctgttggacc 
ggcactcccg 
ggccgggaat 
cgagcgggga 
gcgggactag 
cggcggcagc 



agccagcagc 

ttctttaccc 

gctccatgct 

taaattgagg 

taaaccagca 

cattcagcaa 

aaagaacata 

cataacaggc 

atccagggaa 

tctctgcttt 

cctagcgacc 

tgctgcacac 

taggtattta 

atctaaaaac 

ttttgagcac 

agctctgggc 

aaaatgcagg 

acacaggaga 

attagcatag 

aaaccagggc 

ggcatatccc 

gcagggaaaa 

atatcgcaag 

cgaaggtaat 

cagaggagag 

agttgtcaca 

tgtgtatttt 

gcaataccca 

gaagacatta 

gagcggtgca 

taccaggctg 

gtgacccttc 

cttaccttga 

ggaggcagcg 

ggcccgtacc 

gggggagggc. 

ggctgcgtga 

ggcggcagcg 



tgctgccgcc 
cacccccact 
. ggaaggattg 
tctttgattt 
atgtggatat 
tcatatttaa 
actcaatatt 
cacaaatttt 
aagaaattcc 
agtcctaggt 
taaaccatta 
aaacctcctc 
acatataagc 
aatttagtca 
aggaacaatt 
gaaaaatgag 
tcatccacgc 
aagatcctta 
aaggggcttt 
ttttgcccac 
tccccctctt 
cgacaaaatc 
gatactgggg 
gtctttggtg 
gcgttcagga 
aggccgcaag 
gagagcaggg 
ggcaaaaagg 
aaaaggtagt 
atggtgtgta 
gggaggccca 
tttcccccgg 
cccaggcgct 
agactcgagt 
cgtgaggtgg 
caggggcgga 
gtctctgagc 
gcggcagctc 



ttgtatactc 

tcttaacaca 

gaactatgct 

gaaaatcaac 

aagcattaag 

atttacagct 

ctacacatga 

ggacagagaa 

tgtgtcctcc 

cagattgcac 

ttaatatcaa 

ctggaacctc 

aaggaagtaa 

aggtatggag 

ggcaaaagga 

ttgctggtga 

acccctgacc 

ttcctaagaa 

cccaggagtt 

taccatttac 

ctttagagtc 

tggctcaatt 

catacgccac 

tgggaaaagc. 

agattatgga 

aacaggggag 

ttgggaggcc 

ggagaccaga 

ggggtcgact 

aaggtagctg 

ggcccacgac 

ggcccggtcg 

gccggggcct 

taggcccaac 

gggtgggggg 

gggggccggc 

gcaggcgggc 

actcagcccg 



60 
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1080 
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ctgcccgagc 
gcgggacacg 
ggccggtgcg 
tggggctccg 
gcggcttggt 
cgttctgtga 
aggaatgcgg 
caccggacgc 
tctcgtcgct 
tggttggcgt 
cgctctgccc 
gcggtcggcc 
cgagcggccg 
cctcgtcgtc 
. cgtgcaagtt 
ggccctcccc 
' gaagcgcgcg 
gcacgtttcc 
ctccggggag 
cttgctctcc 
gggagaaggc 
gttctctgct 
tcttccctcg 
ggcttaaagg 
attggaggga 
caaataagga 
ttattattgc 
cacccgagtt 
cgagttagac 
atttctcccg 
cattttcatt 
tcctgatctt 
atcgaggctg 
ttgttgatcc 
gttccacatt 
tccaacctaa 
aaactgagct 
gtatcttaca 
gccacatggt 
ggttcctttt 
agcttgtatt 
ctacacgtta 
gcttgacttg 
ggtttttagg 
gagaggtggc 
ccagcaacca 
tgtctgaaga 
ttttttttta 
gtttgttcca 
aggaactaat 
gggatacaga 
ataaaaattg 
gcttcatcta 
aatgataatt 
tgcctttatg 
agcccagcct 
cctgttgggg 
gccactatgg 
ctgtctctgc 
gccatgtggt 
ttctaagggc 
caaacactaa 
taaaatagtt 



ggaaaicgcca 
ccccctcccg 
ccgccaatca 
gctcctcaga 
gcgtttgcgg 
gacagccggg 
tccgccctgc 
ggccatggct 
gattggcttc 
aaggcgcctg 
actgggtggg 
tctggcgggg 
cccaccctcc 
. tgattggctc 
gagtccatcc 
tcggccccgc 
ctgggggcgg 
gacttgagtt 
tggagggaag 
caaagtcgct 
cgcacccttc 
gcctcctggc 
tgatctgcaa 
ctaacctggt 
caagacttcc 
aaatgggagg 
ttgtgatccg 
ttatactctc 
tatgtaagca 
ctccttctgc 
tatt'atactg 
agaagtctga 
tagctggggc 
tttgccttga 
tgggcctatt 
agcttaactt 
attaaccatt 
tattgccagg 
attgattact 
ggaatgcaaa 
ttttctattt 
ttgctggtct 
tagccaaaca 
ttgttgtgtt 
tcggagttca 
catggtggct 
ccacaagtgt 
aagagaatac 
atatggtagc 
ttttatttga 
agaccaatgc 
aacttctagt 
tacaaagctg 
ttggatttca 
cctttaatcc 
ggtctacaca 
aggggggctc 
atgtggctaa 
tgttggttgg 
ctcttgtttg 
agctgctgag 
aattgtcctt 
atgaggagtg 



ctgaccgcac 

ccgcgccatt 

gcggaggctg 

gagcctcggc 

ggatgggcgg 

tacgagtcgt 

agcaaccgga 

cggggggggg 

ttttcctccc 

tcagttaacg 

gcgggaggta 

cgggggaggg 

ccttcctctg 

tcggggccca 

gccggccagc 

gccgcagagt 

ggacgggcag 

gcctcaagag 

gagcgagggc 

ctgagttgtt 

tccggagggg 

ttctgaggac 

ctccagtctt 

gtgtgggcgt 

cacagatttt 

ataggtagtc 

cctcggagta 

ctgcttgaga 

gaattttaat 

agccttatca 

gcttatccaa 

tgactcatga 

ctcaacactg 

tccttaattt 

ctcagtccag 

tccactccca 

aatggttcca 

ctgatatttt 

gcttactaaa 

gttcaggtgt 

aagcagtgct 

aaatgtgatt 

caagtaaaat 

gtgtgtgctt 

agagcacaga 

cacaaccatc 

"attcacatta 

tgtctccagt 

caataatcaa 

agtgtaactg 

agactttaat 

atcctatttg 

aaagctaagt 

aaaatgtagg 

cagcactctg 

tcaagttcta 

tgagatttca 

atccgtctac 

gtcttttgac 

ctactaactt 

atttgcagcc 

taattttttt 

aggtggacta 
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ggggattccc 

ggcctctccg 
ccggggccgc 
taggtagggg 
ccgcggcagg 
gacgctggaa 

gggggaggga 

ggggcagcgg 

gccgtgtgtg 

gcagccggag 

ggtggggtga 

gagggagggt 

ggggagtcgt 

gaaaactggc 

gggggcggcg 

ctggccgcgc 

tagggctgag 

gggcgtgctg 

tcagttgggc. 

atcagtaagg 

ggaggggagt 

cgccctgggc 

tctagaagat 

tgtcctgcag 

cggttttgtc 

atctggggtt 

ttttccatcg 

tccttactac 

catttttaaa 

aaaggtattt 

cccctagaca 

aaccagacag 

cagttctttt 

tcagtgtcta 

ggagttttac 

tgaatgcctc 

ggtggatgtc 

aagacattaa 

attttgtcat 

ttgttgtctt 

ttctcttgga 

ttgccaagct 

gattaagcaa 

gtgctctata 

ctgctcttcc 

tgtaatggga 

aataaataaa 

agaatttact 

attactcttt 

tggacagagg 

gtcttttctc 

tttaaactgc 

ctgcagccat 

gccagagttt 

gaggcagaga 

tctaggatag 

taaaattata 

ctttctgatg 

actgtgggct 

cccatggctt 

tgatttccag 

tttaaaaaat 

atattaaatg 



agtgccggcg 

cccaccgccc 

ctaaagaaga 

atcgggactc 

coctccgagc 

ggggcaagcg 

gaagggagcg 

aggagcgctt 

aaaacacaaa 

tgcgcagccg 

ggcgagctgg 

cagcgaaagt 

tttacccgcc 

ccttgccatt 

aggaggcgct 

gcccctgcgc 

cggctgcggg 

agccagacct 

tgttttggag 

gagctgcagt 

gttgcaatac 

ctgggagaat 

gggcgggagt 

gggaattgaa 

gggaagtttt 

ttatgcagca 

aggtagatta 

agtatgaaat 

gagcccagta 

tagaacactc 

gagcattggc 

attagttaca 

ataactcctt 

tcacctctcc 

aacaatagat 

tctccttttt 

tcctccccat 

aaggtatatt 

tgtacacatc 

tcctgaccta 

ctggcttgac 

tcttcaggac 

caaatgtatt 

ataatactat 

agaagtcctg 

tctgatgccc 

tcctccttct 

.gaagtaatga 

aagcactgga 

agccataact 

ttacactaag 

tagctttact 

tactaaacat 

agccagccag 

caggcagatc 

ccaggaatac 

attgaagcat. 

agatttgggt 

ttctttaaag 

aaatggcatg 

ggtggggttg. 

gggttatata 

agtccctccc 



ccaggggcac 

cacacttatt 

ggctgtgctt 

tggcgggagg 

gtggtggagc 

ggtggtgggc 

gaaaagtctc 

ccggccgacg 

tggcgtgttt 

ccggcagcct 

acgtgcgggc 

agctcgcgcg 

gccggccggg 

ggctcgtgtt 

cccaggttcc 

aacgtggcag 

gcgggtgcaa 

ccatcgcgca 

gcaggaagca 

ggagtaggcg 

ctttctggga 

cccttccccc 

cttctgggca 

caggtgtaaa 

ttaatagggg 

aaactacagg 

aagacatgct 

tacagtgtcg 

cttcatatcc 

attttagccc 

attttccctt 

tacaccacaa 

agtacacttt 

cgtcagtggt 

gtattgagaa 

ctccatttat 

attacctgat 

tcattattga 

tgtaaaaggt 

aggtcttgtg 

tcatggcatt 

ctataatttt 

tgtgaagctt 

ccaggggctg 

agttcaattc 

tcttctggtg 

tcttcttttt 

aatactttgt 

aatgttacca 

gcagacttgt 

caataaagaa 

taacttttgt 

gaaagcaagt 

tggtggtgct 

tctgagtttg 

acacagaaac 

tccctaatga 

attatttttt 

cctccttcct 

gctttttgcc 

ggaaatcttt 

ataaacctca 

ctataaaaga 
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gctattaagg 
aagggtctta 
tcaaatccag 
aatgttaatt 
tatgtatctc 
tcctgaatac 
ataggggaca 
aacttgaatt 
atgcttaagg 
' ggtgagagca 
gtctttaaac 
aagttttccc 
taaacaaatg 
agatacagaa 
gggtttccct 
ataaacttaa 
ttatatatgt 
aattttcaaa 
ataattgtgg 
ctgggtcaca 
atactagttt 
tgggctaagg 
gaattggagt 
ttatcaccct 
gacaacaata 
ttcgagacag 
ctggcctcga 
'gca'ccaccac 
cactgggtgg 
aaatctactg 
cggatatact 
ccagctttat 
ggttgatggt 
gagaattaaa 
ttcttaagga 
attgccatgg 
cctttagatc 
gagcagcaac 
tgttctcaga 
ttgctatgca 
attaaactct 
aagtattgta 
atttccccaa 
tgtagtttca 
atataaaact 
agagagttta 
attgtgtagt 
tgaacatttt 
actagaaatt 
gaaggaaagt 
ggcagatgaa 
aaattttccc 
gcttacaaat 
taaacctatg 
gaacagttat 
tgttcatttg 
gaaagactga 
gagggaggga 
gtaagttgga 
gtcatacaaa 
ttgatttcac 
tgggcctttt 
catcctgatc 



ctttttgtct 

gagttttagt 

agaatccaaa 

tgcttactgt 

tgcctgatct 

taaggttaaa 

cactaaggga 

attatcaccg 

gatccctagg 

tatttcaaaa 

catctgcata 

ccatcaaatt 

ttggttttgt 

caaagctgct 

gcactatcct 

gtttcctgtc 

taaacatttg 

acctgagcta 

tttgcaggca 

agtctaggag 

atatgttgga 

aattgattct 

cacttgtata 

gttttctaca 

gcatgttttc 

ggtttctctg 

actcagaaat 

gcctggctaa 

atttttaatt 

ttcactctaa 

agca'gattaa 

actaccttgt 

aagcttataa 

atttttgacc 

accttcccag 

ctcttaaagt 

attccttgag 

aggtcttcga 

agcaagcaga 

gtttggattt 

taagttacac 

gctttctttg 

agttccaaat 

tgcttttaaa 

gacatgtaga 

atatagcatg 

cattttgaaa 

aaatgcagac 

aaaaagctga 

gtaatagctt 

aagaaatact 

cacagatata 

gtggcttgac 

tcctaaatag' 

gttgccattt 

tggtgtgaac 

caagtttggg 

gggaggctgg 

ggaacttgta 

atgaacctag 

ctgactaata 

aacccagaaa 

tagtcctgaa 



tatacttaac 

atacagaaac 

ttcttcacag 

gaatataaaa 

gcaacaagat 

atgtagtaat 

gcttgggtga 

caacctactt 

acttgctgag 

aacaaggcag 

tacacaggga 

gggacatgga 

gatttgtgaa 

atttcgaggt 

gatcatgtgc 

aggtgatgtc 

tacttaatgt 

aataagtcat 

agactctctg 

tcaagcattt 

ggacatgttt 

gattagcact 

tctcaaataa 

taatttttgt 

aagccatatt 

tatagccctg 

ccgcctgcct 

gttggatatt 

cagtcagtag 

cagaggctgt 

gggtcaggga 

tctgatagaa 

ggaacttgaa 

aagtgttgtt 

attgccctca 

cttaattaag 

gacaggacag 

gatcaacatg 

ggcatgatgg 

ttctaatgta 

cttgtataat 

tatgtgagga 

tataaaacca 

atgcttaatt 

agtgtttgtc 

tcttttgcaa 

ggagtcattt 

ttgttcgtgt 

agtatttcag 

agaaaattta 

cagtggttct 

aactctaatc 

ttgtcactgt 

tcaaactgtg 

gccctaatgg 

caaagagggg 

tcttcacagt 

cttgttatgc 

cattgttagg 

accaacttct 

cttatccctt 

tttagtttca 

aataggacca 



3/39 

ttttttttta 

tgttgcatcg 

ccaaagtcaa 

atgatagctt 

atgtagacta 

acttttggaa 

tagttggtaa 

tttaaaaaaa 

cacacaagag 

acaaccaaga 

tattaaaata 

tttctccggt 

attgttttca 

ctcttggttt 

taggcctacc 

atatgatttc 

gaaagttagg 

ttttacatgt 

acctagtaac 

caccttgaag 

atccagaaga 

gttagtgagc 

tgctggcctt 

tcaaagaaat 

ttttttcctt 

gctgtcctgg 

ctgcctcctg 

ttgttatata 

tcttaagtgg 

tggtactagt 

tagaaactag 

atatttcagg 

agtggagtaa 

gagccactga 

acactgcacc 

aatattaatt 

tgcttgttta 

atgttcataa 

tcagtgacag 

gtgtaggtag 

ccatgcaatg 

taaaggtgtt 

caacgttaga 

attcaattaa 

cagaacattt 

catactaact 

caatgagtgt 

tttagaaagc 

aagggaaata 

aaaccatata 

tttgagcaat 

tataactcat 

gcttgtttta 

gaatgacttt 

tgaaagaaaa 

ggcatgcact 

tggaattggg 

tgacaagtgt 

agtgtgacaa 

ggaagatgta 

atcaattaaa 

aactcctagg 

ccatcacccc 
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aatgtggtat 

cttaatcaga 

attaagaatt 

ttcctgaggc 

aagttctgcc 

cttgcaggtc 

aatgtgtttc 

aaagccaggc 

tagttacttg 

aactacagtt 

ttccaaataa 

gaataggcag 

agtgatagtt 

atactcagaa 

ttaggctgat 

atatatcaag 

tctttgtggg 

cttacatttg 

cctacctata 

ttgagacgtt 

tattcaggac 

attgagtggc 

ttttaaaaag 

acttgtttgg 

tttttttttt 

aactcacttt 

agtgccggga 

actataacca 

tctttattgg 

ggcacttaag 

tctagcgttt 

acatctagag 

ctactccatt 

gaatggtctc 

acatttggtc 

gtgtaattat 

aggctatatt 

tcccaagatg 

taatgtcact 

aacatatgtg 

tgttatgcaa 

tgtcataaaa 

acttatttat 

caccgtttgt 

cttaaatgta 

tttgtgttgg 

cagattgttt 

aaaactgtca 

agctacttgc 

gttgtcattg 

ataacagctt 

aaatgttaca 

gttatgtgaa 

ttaatctatt 

agtggggagt 

tacacttcaa 

patccctttt 

gattaaattc 

tttggactct 

tataataact 

tacagaagat 

ttagtgttct 

caaaaaaatc 



ctttagaacc 6120 
ttttctagtt 6180 
tctgactttt- 6240 
agggtctcac 6300 
tgcttttgtc 6360 
agattctttt 6420 
aagtgatgaa 64 80 
ctgttagagc 6540 
gcaggctcct 6600 
aaggttacct 6660 
tatttcattc 6720 
agttggaaac 6780 
aaagcccatg 6840 
gcacttcttt 6900 
tgttgttcaa 6960 
gcaaaacatg 7020 
tttgattttt 7080 
gtggaattgt 7140 
gagcactttg 7200 
ttgttagtgt 7260 
tatttttgac 7320 
ctttaggctt 7380 
cccttgttct 7440 
atctcctttt 7500 
tttttggttt 7560 . 
gtagaccagg 7 620 
ttaaaggcgt 7680 
atactaactc 7740 
cccttcatta 7800 
caacttccta 7860 
tgtataccta 7920 * 
tgtactataa 7980' 
tctctgaggg 8040 
agaacataac 8100 
ctgcttgaac 8160 
tgtttttcct 8220 
tctgctgtct 8280 
ttgccattta 8340 
gtgttaaatg 8400 
ttctgtatga 8 4 60 
ttaccatttt 8520 
tgttttgaac 8580 
gaacaatggt 8640 
gttataatat 8700 
tactgtcttt 8760 
tgcgagcaat 8820 
tgaatgttat 8880 
gaagctttga 8940 
tgtattagtt 9000 
ctgaatatct 9060 
gttatattaa 9120 
aatggatgaa 9180 
agtttggcaa 9240 
ggtttgtcta 9300 
gccttggcac 9360 
acatcctttt 9420 
gtcagggagg 94 80 
aaactttgag 9540 
taatgatttg 9600 
ccatgttaca 9660 
gccagccatc 9720 
cactgagcta 9780 
tcaaataaga 9840 



wo 2004/035782 




VEP2003/011233 



4/39 



-tttatgctag tgtttcaaaa ttttaggaat aggtaagatt agaaagtttt aaattttgag 9900 

aaatggcttc tctagaaaga * tgtacatagt gaacactgaa tggctcctaa agagcctaga 9960 

aaactggtac tgagcacaca ggactgagag gtctttcttg aaaagcatgt attgctttac 10020 

gtgggtcaca gaaggcaggc aggaagaact tgggctgaaa ctggtgtctt aagtggctaa 10080 

catcttcaca actgatgagc aagaacttta tcctgatgca aaaaccatcc aaacaaacta 10140- 

agtgaaaggt ggcaatggat cccaggctgc tctagaggag gacttgactt ctcatcccat 10-200 

cacccacacc agatagctca tagactgcca attaacacca gcttctagcc tccacaggca 10260 

cctgcactgg tacacataat ttcacacaaa cacagtaaga agccttccac ctggcatggt 10320 

attgcttatc tttagttccc aacacttggg aggcagaggc cag.ccagggc tatgtgacaa 10380 

aaaccttgtc tagaggagaa. acttcatagc ttatttccta ttcacgtaac caggttagca 10440 

aaatttacca gccagagatg aagctaacag tgtccactat atttgtagtg ttttaagtca 10500 

attttttaaa tatacttaat agaattaaag ctatggtgaa ccaagtacaa agctggtgta 10560 

ttaacttgag aacttagcat aaaaagtagt tcatttgttc agtaaatatt aaatgcttac 10620 

tggcaaagat tatgtcagga acttggtaaa tggtgatgaa acaatcatag ttgtacatct 10680 

tggttctgtg atcaccttgg tttgaggtaa aagtggttcc tttgatcaag gatggaattt 10740 

•taagtttata ttcaatcaat aatgtattat tttgtgattg caaaattgcc tatctagggt 10800 

ataaaacctt taaaaatttc ataataccag " ttcattctcc agttactaat tccaaaaagc 108 60 

cactgactat ggtgccaatg tggattctgt tctcaaagga aggattgtct gtgcccttta 10920 

ttctaataga aacatcacac tgaaaatcta agctgaaaga agccagactt tcctaaataa 10980 

ataactttcc ataaagctca aacaaggatt acttttagga ggcactgtta aggaactgat 11040 

aagtaatgag gttacttata taatgatagt cccacaagac tatctgagga aaaatcagta 11100 

caactcgaaa acagaacaac cagctaggca ggaataacag ggctcccaag tcaggaggtc 11160 

tatccaacac ccttttctgt tgagggcccc agacctacat attgtataca aacagggagg 11220 

tgggtgattt taactctcct gaggtacctt ggtaaatctt tgtcctgagt aagcagtaca 11280 

gtgtacagtt tacattttca tttaaagata cattagctcc ctctaccccc taagactgac 11340 

aggcactttg ggggtgggga gggctttgga aaataacgct tccatacact aaaagagaaa 114 00 

tttctttaat taggcttgtt. ggttccatac atctactggt gtttctacta cttagtaata 114 60 

ttataatagt cacacaagca tctttgctct gtttaggttg tatatttatt ttaaggcaga 11520 

tgataaaact gtagatctta agggatgctt ctgcttctga gatgatacaa agaatttaga 11580 

ccataaaaca gta^gttgca caagcaatag aatatggcct aaagtgttct gacacttaga 11640 

agccaagcag tgtaggcttc ttaagaaata ccattacaat caccttgcta gaaatcaagc 11700 

attctggagt ggtcaagcag tgtaacctgt actgtaagtt acttttctgc tatttttctc 117 60 

ccaaagcaag ttctttatgc tgatatttcc agtgttagga actacaaata ttaataagtt 11820 

gtcttcactc ttttctttac caaggagggt ctcttccttc atcttgatct gaaggatgaa 11880 

caaaggcttg agcagtgcgc tttagaagat aaactgcagc atgaaggccc ccgatgttca 11940 

cccagactac .atggaccttt cgccacacat gtcccattcc agataaggcc tggcacacac 12000 

aaaaaacata agtcattagg ctaccagtct gattctaaaa caacctaaaa tcttcccact 12060 

. taaatgctat gggtggtggg ttggaaagtt gactcagaaa atcacttgct gtttttagag 12120 

aggatctggg ttcagtttct gatacattgt ggcttacaac tataactcca gttctagggg 12180 

gtccatccaa catcctcttc tgttgagggc accaaataaa tgtattgtgt acaaacaggg 12240 

aggtgagtga tttaactctc gtgtatagta ccttggtaaa acatttcttg tcctgagtaa 12300 

gcagtacagc tctgcctgtc cctggtctac agacacggct catttcccga aggcaagctg 12360 

gatagagatt •ccaatttctc ttcttggatc ccatcctata aaagaaggtc aagtttaatc 12420 

tattgcaaaa ggtaaatagg tagtttctta catgagacaa gaacaaatct taggtgtgaa 124 80 

gcagtcatct tttacaggcc agagcctcta ttctatgcca atgaaggaaa ctgttagtcc 12540 

agtgttatag agttagtcca gtgtatagtt ttctatcaga acactttttt tttaaacaac 12600 

tgcaacttag cttattgaag acaaaccacg agtagaaatc tgtccaagaa gcaagtgctt 12660 

ctcagcctac aatgtggaat aggaccatgt aatggtacag tgagtgaaat gaattatggc 12720 

atgtttttct gactgagaag acagtacaat aaaaggtaaa ctcatggtat ttatttaaaa 127 80 

agaatccaat ttctaccttt ttccaaatgg catatctgtt acaataatat ccacagaagc 12840 

agttctcagt gggaggttgc agatatccca ctgaacagca tcaatgggca aaccccaggt 12900 

tgtttttctg tggagacaaa ggtaagatat ttcaatatat tttcccaagc taatgagatg 12960 

gctcagcaaa taatggtact ggccattaag tctcatgacc tgagcttgat cctcagggac 13020 

catgtggtac aaggagagac ctaaatcctt cagttggact" tcaatcttct accctcatgt 13080. 

ccacacacaa ataaatacaa taaaaaacat tctgcagtcg aatttctaaa agggcgaat 13139 



<210> 2 
<211> 5409 
<212> DNA 

<213> Artificial Sequence 



<22b> 



. wo 2004/035782 ^ . ^r/EP2003/011233 

^9 - 5/39 • • 

. <223> ■ Description of Artificial Sequence: sequence of * 
homology region 

<400> 2 • . 

caggccctcc gagcgtggtg gagccgttct gtgagacagc cgggtacgag tcgtgacgct 60 
ggaaggggca agcgggtggt gggcaggaat gcggtccgcc ctgcagcaac cggaggggga 120 
gggagaaggg agcggaaaag tctccaccgg acgcggccat ggctcggggg ggggggggca 180 
gcggaggagc gcttccggcc gacgtctcgt cgctgattgg cttcttttcc tcccgccgtg 240 
. tgtgaaaaca caaatggcgt gttttggttg gcgtaaggcg cctgtcagtt aacggcagcc 300 
ggagtgcgca gccgccggca gcctcgctct gcccactggg tggggcggga ggtaggtggg 360 ' 
gtgaggcgag ctggacgtgc gggcgcggtc ggcctctggc ggggcggggg aggggaggga 420 
gggtcagcga aagtagctcg cgcgcgagcg gccgcccacc ctccccttcc tctgggggag 480 
tcgttttacc cgccgccggc cgggcctcgt cgtctgattg gctctcgggg cccagaaaac 540 
tggcccttgc cattggctcg tgttcgtgca agttgagtcc atccgccggc cagcgggggc 600 
ggcgaggagg cgctcccagg ttccggccct . cccctcggcc ccgcgccgca gagtctggcc 660 
gcgcgcccct gcgcaacgtg gcaggaagcg cgcgctgggg gcggggacgg gcagtagggc 720 
tgagcggctg cggggcgggt gcaagcacgt ttccgacttg agttgcctca agaggggcgt 780 
gctgagccag acctccatcg cgcactccgg ggagtggagg gaaggagcga gggctcagtt 840 
gggctgtttt ggaggcagga agcacttgct ctcccaaagt cgctctgagt tgttatcagt 900 
aagggagctg cagtggagta ggcggggaga aggccgcacc cttctccgga ggggggaggg 960 
gagtgttgca atacctttct gggagttctc tgctgcctcc tggcttctga ggaccgccct 1020 ' 
gggcctggga gaatcccttc .cccctcttcc ctcgtgatct gcaactccag tctttctaga 1080 
agatgggcgg gagtcttctg ggcaggctta aaggctaacc tggtgtgtgg gcgttgtcct 1140 
gcaggggaat tgaacaggtg taaaattgga gggacaagac ttcccacaga ttttcggttt 1200 " 
tgtcgggaag ttttttaata ggggcaaata aggaaaatgg gaggataggt agtcatctgg 1260 • 
ggttttatgc agcaaaacta caggttatta ttgcttgtga tccgcctcgg agtattttcc 1320 
atcgaggtag attaaagaca tgctcacccg agttttatac tctcctgctt gagatcctta 1380 ' 
ctacagtatg aaattacagt gtcgcgagtt agactatgta agcagaattt taatcatttt 14 4 0 
taaagagccc agtacttcat atccatttct cccgctcctt ctgcagcctt atcaaaaggt 1500 
attttagaac actcatttta gccccatttt catttattat actggcttat ccaaccccta 1560 
gacagagcat tggdkttttc cctttcctga tcttagaagt ctgatgactc atgaaaccag 1620 
acagattagt tacatacacc acaaatcgag gctgtagctg gggcctcaac actgcagttc 1680 ' 
ttttataact ccttagtaca ctttttgttg atcctttgcc ttgatcctta attttcagtg 1740 
tct.atcacct ctcccgtcag tggtgttcca catttgggcc tattctcagt ccagggagtt 1800 
ttacaacaat • agatgtattg agaatccaac ctaaagctta actttccact cccatgaatg 18 60 
cctctctcct ttttctccat ttataaactg agctattaac cattaatggt tccaggtgga 1920 
tgtctcctcc ccatattacc tgatgtatct tacatattgc caggctgata ttttaagaca 1980 
ttaaaaggta tatttcatta ttgagccaca tggtattgat tactgcttac taaaattttg 2040 
tcattgtaca catctgtaaa aggtggttcc ttttggaatg caaagttcag gtgtttgttg 2100 
tctttcctga cctaaggtct tgtgagcttg tattttttct atttaagcag tgctttctct 2160 * 
tggactggct tgactcatgg cattctacac gttattgctg gtctaaatgt gattttgcca 2220 
agcttcttca ggacctataa ttttgcttga cttgtagcca aacacaagta aaatgattaa 2280 
gcaacaaatg tatttgtgaa gcttggtttt taggttgttg tgttgtgtgt gcttgtgctc 2340 
tataataata ctatccaggg gctggagagg tggctcggag ttcaagagca cagactgctc 2400 
ttccagaagt cctgagttca attcccagca accacatggt ggctcacaac catctgtaat 2460 
gggatctgat gccctcttct ggtgtgtctg aagaccacaa gtgtattcac attaaataaa 2520 
taaatcctcc ttcttcttct tttttttttt tttaaagaga atactgtctc cagtagaatt 2580 
tactgaagta atgaaatact ttgtgtttgt tccaatatgg tagccaataa tcaaattact 2640 
ctttaagcac tggaaatgtt accaaggaac taatttttat ttgaagtgta actgtggaca 2700 
gaggagccat aactgcagac ttgtgggata cagaagacca atgcagactt taatgtcttt 27 60 
tctcttacac taagcaataa agaaataaaa • attgaacttc tagtatccta tttgtttaaa 2820 
ctgctagctt tacttaactt ttgtgcttca tctatacaaa gctgaaagct aagtctgcag 2880 
ccattactaa acatgaaagc aagtaatgat aattttggat ttcaaaaatg tagggccaga 2940 
gtttagccag ccagtggtgg tgcttgcctt tatgccttta atcccagcac tctggaggca 3000 
gagacaggca gatctctgag tttgagccca gcctggtcta cacatcaagt tctatctagg 3060 
atagccagga atacacacag aaaccctgtt ggggaggggg gctctgagat ttcataaaat 3120 
tataattgaa gcattcccta atgagccact atggatgtgg ctaaatccgt ctacctttct 3180 
gatgagattt gggtattatt ttttctgtct ctgctgttgg ttgggtcttt tgacactgtg 3240 
ggctttcttt aaagcctcct tcctgccatg tggtctcttg tttgctacta acttcccatg 3300 
gcttaaatgg catggctttt tgccttctaa gggcagctgc tgagatttgc agcctgattt 3360 • 
ccagggtggg gttgggaaat ctttcaaaca ctaaaattgt cctttaattt tttttttaaa 3420 
aaatgggtta tataataaac ctcataaaat agttatgagg agtgaggtgg actaatatta 3480 
aatgagtccc tcccctataa aagagctatt aaggcttttt gtcttatact taactttttt 354 0 
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tttaaatgtg 
atcgcttaat 
tcaaattaag 
gcttttcctg 
actaaagttc 
ggaacttgca 
gtaaaatgtg 
aaaaaaagcc 
agagtagtta 
aagaaactac 
aatattccaa 
cggtgaatag 
ttcaagtgat 
gtttatactc 
taccttaggc 
tttcatatat 
taggtctttg 
atgtcttaca 
taaccctacc 
gaagttgaga 
aagatattca 
gagcattgag 
ccttttttaa 
aaatacttgt 
cctttttttt 
ctggaactca 
cctgagtgcc 
tataactata 
gtggtcttta 
tagtggcact 
ctagtctagc 
caggacatc 



gtatctttag 
cagattttct 
aatttctgac 
aggcagggtc 
tgcctgcttt 
ggtcagattc 
tttcaagtga 
aggcctgtta 
cttggcaggc 
agttaaggtt 
ataatatttc 
gcagagttgg 
agttaaagcc 
agaagcactt 
tgattgttgt 
caaggcaaaa 
tgggtttgat 
tttggtggaa 
tatagagcac 
cgttttgtta 
ggactatttt 
tggcctttag 
aaagcccttg 
ttggatctcc 
tttttttttg 
ctttgtagac 
gggattaaag 
accaatacta 
ttggcccttc 
taagcaactt 
gttt'^gtata 



aaccaagggt 
agtttcaaat 
ttttaatgtt 
tcactatgta 
tgtctcctga 
ttttataggg 
tgaaaacttg 
gagcatgctt 
tcctggtgag 
acctgtcttt 
attcaagttt 
aaactaaaca 
catgagatac 
ctttgggttt 
tcaaataaac 
catgttatat 
ttttaatttt 
ttgtataatt 
tttgctgggt 
gtgtatacta 
tgactgggct 
gcttgaattg 
ttctttatca 
ttttgacaac 
gtttttcgag 
caggctggcc 
gcgtgcacca 
actccactgg 
attaaaatct 
cctacggata 
cctaccagct 



6/39 

cttagagttt 
ccagagaatc 
aatttgctta 
tctctgcctg 
atactaaggt 
gacacactaa 
aattattatc 
aagggatccc 
agcatatttc 
aaaccatctg 
tcccccatca 
aatgttggtt. 
agaacaaagc 
ccctgcacta 
ttaagtttcc 
atgttaaaca 
caaaacctga 
gtggtttgca 
cacaagtcta 
gtttatatgt 
aaggaattga 
gagtcacttg 
ccctgttttc 
aatagcatgt 
acagggtttc 
tcgaactcag 
ccacgcctgg 
gtggattttt 
actgttcact 
tactagcaga 
ttatactacc 
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tagtatacag 
caaattcttc 
ctgtgaatat 
atctgcaaca 
taaaatgtag 
gggagcttgg 
accgcaacct 
taggacttgc 
aaaaaacaag 
catatacaca 
aattgggaca 
ttgtgatttg 
tgctatttcg 
tcctgatcat 
tgtcaggtga 
tttgtactta 
gctaaataag 
ggcaagactc 
ggagtcaagc 
tggaggacat 
ttctgattag 
tatatctcaa 
tacataattt 
tttcaagcca 
tctgtatagc 
aaatccgcct 
ctaagttgga 
aattcagtca 
ctaacagagg 
ttaagggtca 
ttgttctgat 



aaactgttgc 
acagccaaag 
aaaaatgata 
agatatgtag 
taatactttt 
gtgatagttg 
actttttaaa 
tgagcacaca 
gcagacaacc 
gggatattaa 
tggatttctc 
tgaaattgtt 
aggtctcttg 
gtgctaggcc 
tgtcatatga 
atgtgaaagt 
tcatttttac 
tctgacctag 
atttcacctt 
gtttatccag 
cactgttagt 
ataatgctgg 
ttgttcaaag 
tatttttttt 
cctggctgtc 
gcctctgcct 
tattttgtta 
gtagtcttaa 
ctgttggtac 
gggatagaaa 
agaaatattt 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5409 



<210> 3 
<211> 4413 • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequence of 
Fluc-hygro insert 



<400>-.3 

tctaggtaac 

tttccttgat 

caaactcttc 

tctaagtaag 

ataaagaaag 

cataaggcta 

atcgaggtgg 

atgaaacgat 

caattcttta 

gacatttata 

ttcgtttcca 

caaaaaatta 

ttcgtcacat 

gatagggaca 

ggtgtcgctc 

tttggcaatc 

tttggaatgt 

agatttgaag 

ctggtgccaa 



cgatatccct 
gatgtcatac 
gcggtctttc 
cttggcattc 
gcccggcgcc 
tgaagagata 
acatcactta 
atgggctgaa 
tgccggtgtt 
.atgaacgtga 
aaaaggggtt 
ttatcatgga 
ctcatctacc 
agacaattgc 
tgcctcatag 
aaatcattcc 
ttactacact 
aagagctgtt 
ccctattctc 



gcaggggtga 
ttatcctgtc 
cagtactcct 
cggtactgtt 
attctatccg 
cgccctggtt 
cgctgagtac 
tacaaatcac 
gggcgcgtta 
attgctcaac 
gcaaaaaatt 
ttctaaaacg 
tcccggtttt 
actgatcatg 
aactgcctgc 
ggatactgcg 
cggatatttg 
tctgaggagc 
cttcttcgcc 



cctgcacgtc 
cctttttttt 
gcaggtgact 
ggtaaagcca 
ctggaagatg 
cctggaacaa 
ttcgaaatgt 
agaatcgtcg 
tttatcggag 
agtatgggca 
ttgaacgtgc 
gattaccagg 
aatgaatacg 
aactcctctg 
gtgagattct 
attttaagtg 
atatgtggat 
cttcaggatt 
aaaagcactc 



tagggcgcag 
ccacagctcg 
gactgagtcg 
ccatggaaga 
gaaccgctgg 
ttgcttttac 
ccgttcggtt 
tatgcagtga 
ttgcagttgc 
tttcgcagcc 
aaaaaaagct 
gatttcagtc 
attttgtgcc 
gatctactgg 
cgcatgccag 
ttgttccatt 
ttcgagtcgt 
acaagattca 
tgattgacaa 



tagtccaggg 
cggttgagga 
agatctgcga 
cgccaaaaac 
agagcaactg 
agatgcacat 
ggcagaagct 
aaactctctt 
gcccgcgaac 
taccgtggtg 
cccaatcatc 
gatgtacacg 
agagtccttc 
tctgcctaaa 
agatcctatt 
ccatcacggt 
cttaatgtat 
aagtgcgctg 
atacgattta 



60 
120 
180 
•240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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tctaatttac acgaaattgc ttctggtggc gctcccctct ctaaggaagt cggggaagcg 1200 
gttgccaaga ggttccatct gccaggtatc aggcaaggat atgggctcac tgagactaca 1260 
tcagctattc tgattacacc cgagggggat gataaaccgg gcgcggtcgg taaagttgtt 1320 
ccattttttg aagcgaaggt tgtggatctg gataccggga aaacgctggg cgttaatcaa 1380 
agaggcgaac tgtgtgtgag aggtcctatg attatgtccg gttatgtaaa caatccggaa 14 4 0 
gcgaccaacg ccttgattga caaggatgga tggctacatt ctggagacat agcttactgg 1500 
gacgaagacg aacacttctt catcgttgac cgcctgaagt ctctgattaa gtacaaaggc 1560 
tatcaggtgg ctcccgctga attggaatcc atcttgctcc aacaccccaa catcttcgac 1620 
gcaggtgtcg caggtcttcc cgacgatgac gccggtgaac ttcccgccgc cgttgttgtt . 1680 
ttggagcacg gaaagacgat gacggaaaaa gagatcgtgg attacgtcgc cagtcaagta 17 4 0 
acaaccgcga aaaagttgcg cggaggagtt gtgtttgtgg acgaagtacc gaaaggtctt 1800 
accggaaaac tcgacgcaag aaaaatcaga gagatcctca taaaggccaa gaagggcgga 1860 
aagatcgccg tgtaattcta gaccggttcg agatccaggc gcggatcaat aaaagatcat 1920 
tattttcaat agatctgtgt gttggttttt tgtgtgcctt gggggagggg gaggccagaa 1980 
tgaggcgcgg ccaaggggga gggggaggcc agaatgacct tgggggaggg ggaggccaga 204 0 
atgaccttgg gggaggggga ggccagaatg aggcgcgccc ccgatccgtc gacgccctaa 2100 
ggccatagcg gccgccctga ggccgcgggc gatcgcctag gggtaaccga agttcctata 2160 
ctttctagag aataggaact tcggaatagg aacttcttat aatctagaag atctggatcc 2220 
acgattcgag ggcccctgca ggtcaattct accgggtagg ggaggcgctt ttcccaaggc 2280 
agtctggagc atgcgcttta gcagccccgc tggcacttgg cgctacacaa gtggcctctg 2340 
gcctcgcaca cattccacat ccaccggtag cgccaaccgg ctccgttctt tggtggcccc 2400 
ttcgcgccac cttctactcc tcccctagtc aggaagttcc cccccgcccc gcagctcgcg 24 60 
tcgtgcagga cgtgacaaat ggaagtagca cgtctcacta gtctcgtgca gatggacagc 2520 
accgctgagc aatggaagcg ggtaggcctt tggggcagcg gccaatagca gctttgctcc 2580 
ttcgctttct. gggctcagag gctgggaagg ggtgggtccg ggggcgggct caggggcggg 2640 
ctcaggggcg gggcgggcgc gaaggtcctc ccgaggcccg gcattctcgc acgcttcaaa 27 00 
agcgcacgtc tgccgcgctg ttctcctctt cctcatctcc gggcctttcg acgatccagc 27 60 
cgccaccatg aaaaagcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa 2820 
gttcgacagc gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag 2880 
cttcgatgta ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta 2940 
caaagatcgt tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct 3000 
tgacattggg gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt 3060 
cacgttgcaa gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc 3120 
catggatgcc atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc 3180 
gcaaggaatc- ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca 3240 
tgtgtatcac tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct 3300 
cgatgagctg atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga 3360 
tttcggctcc aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag 3420 
cgaggcgatg ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg 348 0 
gttggcttgt atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg 3540 
atcgccgcgg ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt 3600 
ggttgacggc aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg 3660 
atccggagcc gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac 3720 
cgatggctgt gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag 3780 
ggcaaaggaa tagtcgatgc agaaattgat gatctattaa acaataaaga tgtccactaa 3840 
aatggaagtt tttcctgtca tactttgtta agaagggtga gaacagagta cctacatttt 3900 
gaatggaagg attggagcta cgggggtggg ggtggggtgg gattagataa atgcctgctc 3960 
tttactgaag gctctttact attgctttat gataatgttt catagttgga tatcataatt 4020 
taaacaagca aaaccaaatt aagggccagc tcattcctcc cactcatgat ctatagatct 4080 
atagatctct cgtgggatca ttgtttttct cttgattccc actttgtggt tctaagtact 4140 
gtggtttcca aatgtgtcag tttcatagcc tgaagaacga gatcagcagc ctctgttcca 4200 
catacacttc attctcagta ttgttttgcc aagttctaat tccatcagaa gctgactcta 4260 
gatcctgcag gaattcatat gaagttccta tactttctag agaataggaa cttcggaata 4320 
ggaacttcaa aatgtcgcgg cgcgccggta accgaagttc ctatactttc tagagaatag 4380 
gaacttcgga ataggaactt caagcttaag cgc 4413 



<210> 4 
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<213>. Artificial Sequence 
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<223> Description of Artificial Sequence: Targeting 

vector for Rosa26 locus with a Fluc-hygro insert 

<400> 4 • . . * 
ctagggataa cagggtaata tagccgcggc aggccctccg agcgtggtgg agccgttctg 60 
tgagacagcc gggtacgagt cgtgacgctg gaaggggcaa gcgggtggtg ggcaggaatg 120 
cggtccgccc tgcagcaacc ggagggggag ggagaaggga gcggaaaagt ctccaccgga 180 
cgcggccatg gctcgggggg gggggggcag cggaggagcg cttccggccg acgtctcgtc 240 
gctgattggc ttcttttcct cccgccgtgt gtgaaaacac aaatggcgtg ttttggttgg 300 " 
cgtaaggcgc ctgtcagtta acggcagccg gagtgcgcag ccgccggcag cctcgctctg 360 
cccactgggt ggggcgggag gtaggtgggg tgaggcgagc tggacgtgcg ggcgcggtcg 420 
gcctctggcg gggcggggga ggggagggag ggtcagcgaa agtagctcgc gcgcgagcgg 4 80 
ccgcccaccc tccccttcct ctgggggagt cgttttaccc gccgccggcc gggcctcgtc 540 
gtctgattgg ctctcggggc ccagaaaact ggcccttgcc attggctcgt gttcgtgcaa 600 
gttgagtcca tccgccggcc agcgggggcg gcgaggaggc gctcccaggt tccggccctc 660 
.ccctcggccc cgcgccgcag agtctggccg cgcgcccctg cgcaacgtgg caggaagcgc 720 • 
gcgctggggg cggggacggg cagtagggct gagcggctgc ggggcgggtg caagcacgtt 780 
tccgacttga gttgcctcaa gaggggcgtg ctgagccaga cctccatcgc gcactccggg 840 
gagtggaggg aaggagcgag ggctcagttg ggctgttttg gaggcaggaa gcacttgctc 900 
tcccaaagtc gctctgagtt gttatcagta agggagctgc agtggagtag gcggggagaa 960 
ggccgcaccc ttctccggag gggggagggg agtgttgcaa tacctttctg ggagttctct 1020 
gctgcctcct ggcttctgag gaccgccctg ggcctgggag aatcccttcc ccctcttccc 1080 
tcgtgatctg caactccagt ctttctaggt aaccgatatc cctgcagggg tgacctgcac 1140 
gtctagggcg cagtagtcca gggtttcctt gatgatgtca tacttatcct gtcccttttt 1200 
tttccacagc tcgcggttga ggacaaactc ttcgcggtct ttccagtact cctgcaggtg 1260" 
actgactgag tcgagatctg cgatctaagt aagcttggca ttccggtact gttggtaaag 1320 
ccaccatgga agacgccaaa aacataaaga aaggcccggc gccattctat ccgctggaag .1380 
atggaaccgc tggagagcaa ctgcataagg ctatgaagag atacgccctg gttcctggaa 144 0 . 
caattgcttt tacagatgca catatcgagg tggacatcac ttacgctgag tacttcgaaa 1500 
tgtccgttcg gttggcagaa gctatgaaac gatatgggct gaatacaaat cacagaatcg 1560 
tcgtatgcag tgaaf^actct cttcaattct ttatgccggt gttgggcgcg ttatttatcg 1620 
gagttgcagt tgcgcccgcg aacgacattt ataatgaacg tgaattgctc aacagtatgg 1680 
gcatttcgca gcctaccgtg gtgttcgttt ccaaaaaggg gttgcaaaaa attttgaacg 17 4 0 
tgcaaaaaaa gctcccaatc atccaaaaaa ttattatcat ggattctaaa acggattacc 1800 
agggatttca gtcgatgtac acgttcgtca catctcatct acctcccggt tttaatgaat 1860 
acgattttgt gccagagtcc ttcgataggg acaagacaat tgcactgatc atgaactcct 1920 
ctggatctac tggtctgcct aaaggtgtcg ctctgcctca tagaactgcc tgcgtgagat 1980 
tctcgcatgc cagagatcct atttttggca atcaaatcat tccggatact gcgattttaa 204 0 
gtgttgttcc attccatcac ggttttggaa tgtttactac actcggatat ttgatatgtg 2100 
gatttcgagt cgtcttaatg tatagatttg aagaagagct gtttctgagg agccttcagg 2160 
attacaagat tcaaagtgcg ctgctggtgc caaccctatt ctccttcttc gccaaaagca 2220 
ctctgattga caaatacgat ttatctaatt tacacgaaat tgcttctggt ggcgctcccc 2280 
tctctaagga agtcggggaa gcggttgcca agaggttcca tctgccaggt atcaggcaag 2340 
gatatgggct cactgagact acatcagcta ttctgattac acccgagggg gatgataaac 2400 
cgggcgcggt cggtaaagtt gttccatttt ttgaagcgaa ggttgtggat ctggataccg 24 60 
ggaaaacgct gggcgttaat caaagaggcg aactgtgtgt gagaggtcct atgattatgt 2520 
ccggttatgt aaacaatccg gaagcgacca acgccttgat tgacaaggat ggatggctac 2580 
attctggaga catagcttac tgggacgaag acgaacactt cttcatcgtt gaccgcctga 2640 
agtctctgat taagtacaaa ggctatcagg tggctcccgc tgaattggaa tccatcttgc 2700 
tccaacaccc caacatcttc gacgcaggtg tcgcaggtct tcccgacgat gacgccggtg 2760 
aacttcccgc cgccgttgtt gttttggagc acggaaagac gatgacggaa aaagagatcg 2820- 
tggattacgt cgccagtcaa gtaacaaccg cgaaaaagtt gcgcggagga gttgtgtttg 2880 
tggacgaagt accgaaaggt cttaccggaa aactcgacgc aagaaaaatc agagagatcc 2940 
tcataaaggc caagaagggc ggaaagatcg ccgtgtaatt ctagaccggt tcgagatcca 3000 
ggcgcggatc aataaaagat cattattttc aatagatctg tgtgttggtt ttttgtgtgc 3060 
cttgggggag ggggaggcca gaatgaggcg cggccaaggg ggagggggag gccagaatga 3120 
ccttggggga gggggaggcc agaatgacct tgggggaggg ggaggccaga atgaggcgcg 3180 
cccccgatcc gtcgacgccc taaggccat'a gcggccgccc tgaggccgcg ggcgatcgcc 3240 
taggggtaac cgaagttcct atactttcta gagaatagga acttcggaat aggaacttct 3300 
tataatctag aagatctgga • tccacgattc gagggcccct gcaggtcaat tctaccgggt 3360- 
aggggaggcg cttttcccaa ggcagtctgg agcatgcgct ttagcagccc cgctggcact 3420 
tggcgctaca. caagtggcct ctggcctcgc acacattcca catccaccgg tagcgccaac 3480 
cggctccgtt ctttggtggc cccttcgcgc caccttctac' tcctccccta gtcaggaagt 3540 
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tcccccccgc cccgcagctc gcgtcgtgca ggacgtgaca aatggaagta gcacgtctca 3600 
ctagtctcgt gcagatggac agcaccgctg agcaatggaa gcgggtaggc ctttggggca 3660 ■ 
gcggccaata. gcagctttgc.tccttcgctt tctgggctca gaggctggga aggggtgggt 3720 
ccgggggcgg gctcaggggc gggctcaggg gcggggcggg cgcgaaggtc ctcccgaggc 3780 
ccggcattct cgcacgcttc aaaagcgcac gtctgccgcg ctgttctcct cttcctcatc 3840 
tccgggcctt tcgacgatcc agccgccacc atgaaaaagc ctgaactcac cgcgacgtct 3900 
gtcgagaagt ttctgatcga aaagttcgac agcgtctccg acctgatgca gctctcggag 3960 
•ggcgaagaat ctcgtgcttt cagcttcgat gtaggagggc gtggatatgt cctgcgggta 4020' 
aatagctgcg ccgatggttt ctacaaagat cgttatgttt atcggcactt tgcatcggcc 4080 
gcgctcccga ttccggaagt gcttgacatt ggggaattca gcgagagcct gacctattgc 4140 
atctcccgcc gtgcacaggg tgtcacgttg caagacctgc ctgaaaccga actgcccgct 4200 
gttctgcagc cggtcgcgga ggccatggat gccatcgctg cggccgatct tagccagacg 4260 
agcgggttcg gcccattcgg accgcaagga atcggtcaat acactacatg gcgtgatttc 4320 
atatgcgcga ttgctgatcc ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc 4380 
agtgcgtccg tcgcgcaggc tctcgatgag ctgatgcttt gggccgagga ctgccccgaa 44 4 0 
gtccggcacc tcgtgcacgc ggatttcggc tccaacaatg tcctgacgga caatggccgc 4500 
ataacagcgg tcattgactg gagcgaggcg atgttcgggg attcccaata cgaggtcgcc 4560 
aacatcttct tctggaggcc gtggttggct tgtatggagc agcagacgcg. ctacttcgag 4620 
cggaggcatc cggagcttgc aggatcgccg cggctccggg cgtatatgct ccgcattggt 4 680 
cttgaccaac tctatcagag cttggttgac ggcaatttcg atgatgcagc ttgggcgcag 474 0 
ggtcgatgcg acgcaatcgt ccgatccgga gccgggactg tcgggcgtac acaaatcgcc 4 800' 
cgcagaagcg cggccgtctg gaccgatggc tgtgtagaag tactcgccga tagtggaaac 4860 
cgacgcccca gcactcgtcc gagggcaaag .gaatagtcga tgcagaaatt gatgatctat 4 920'' 
taaacaataa agatgtccac taaaatggaa gtttttcctg tcatactttg ttaagaaggg 4 980 
tgagaacaga gtacctacat tttgaatgga aggattggag ctacgggggt gggggtgggg 5040 
tgggattaga taaatgcctg ctctttactg aaggctcttt actattgctt tatgataatg 5100 • 
tttcatagtt ggatatcata atttaaacaa gcaaaaccaa attaagggcc agctcattcc 5160 
tcccactcat gatctataga tctatagatc tctcgtggga tcattgtttt tctcttgatt 5220 
cccactttgt ggttctaagt actgtggttt ccaaatgtgt cagtttcata gcctgaagaa 5280 • 
cgagatcagc agcctctgtt ccacatacac ttcattctca gtattgtttt gccaagttct 5340 
aattccatca gaagctgact ctagatcctg caggaattca tatgaagttc ctatactttc 54 00 
tagagaatag gaacttcgga ataggaactt caaaatgtcg cggcgcgccg gtaaccgaag 5460 
ttcctatact ttctagagaa taggaacttc ggaataggaa cttcaagctt aagcgctaga 5520 
agatgggcgg gagtcttctg ggcaggctta aaggctaacc tggtgtgtgg gcgttgtcct 5580 
gcaggggaat tgaacaggtg taaaattgga gggacaagac ttcccacaga ttttcggttt 564 0 
tgtcgggaag ttttttaata ggggcaaata aggaaaatgg gaggataggt agtcatctgg 5700 
ggttttatgc agcaaaacta caggttatta ttgcttgtga tccgcctcgg agtattttcc 5760 
atcgaggtag attaaagaca tgctcacccg agttttatac tctcctgctt gagatcctta 5820 
ctacagtatg aaattacagt gtcgcgagtt agactatgta agcagaattt taatcatttt 588 0 
taaagagccc agtacttcat atccatttct cccgctcctt ctgcagcctt atcaaaaggt 5940 
•attttagaac actcatttta gccccatttt catttattat actggcttat ccaaccccta 6000 
gacagagcat tggcattttc cctttcctga tcttagaagt ctgatgactc atgaaaccag 6060 
acagattagt tacatacacc acaaatcgag gctgtagctg gggcctcaac actgcagttc 6120 
ttttataact ccttagtaca ctttttgttg atcctttgcc ttgatcctta attttcagtg 618 0 
tctatcacct ctcccgtcag tggtgttcca catttgggcc tattctcagt ccagggagtt 6240 
ttacaacaat agatgtattg agaatccaac ctaaagctta actttccact cccatgaatg 6300 
cctctctcct ttttctccat ttataaactg agctattaac cattaatggt tccaggtgga 6360 
tgtctcctcc ccatattacc tgatgtatct tacatattgc caggctgata ttttaagaca 6420 
ttaaaaggta tatttcatta ttgagccaca tggtattgat tactgcttac taaaattttg 6480 
tcattgtaca catctgtaaa aggtggttcc ttttggaatg caaagttcag gtgtttgttg 6540 
tctttcctga cctaaggtct tgtgagcttg tattttttct atttaagcag tgctttctct 6600 
tggactggct tgactcatgg cattctacac gttattgctg gtctaaatgt gattttgcca 6660 
agcttcttca ggacctataa ttttgcttga cttgtagcca aacacaagta aaatgattaa 6720 
gcaacaaatg tatttgtgaa gcttggtttt taggttgttg tgttgtgtgt .gcttgtgctc 67 80 
tataataata ctatccaggg gctggagagg tggctcggag ttcaagagca cagactgctc 6840 
ttccagaagt cctgagttca attcccagca accacatggt ggctcacaac catctgtaat 6900 
gggatctgat gccctcttct ggtgtgtctg aagaccacaa gtgtattcac attaaataaa 6960 
taaatcctcc ttcttcttct ,tttttttttt tttaaagaga atactgtctc cagtagaatt 7020 
tactgaagta atgaaatact ttgtgtttgt tccaatatgg tagccaataa tcaaattact 7080 
ctttaagcac tggaaatgtt accaaggaac taatttttat ttgaagtgta actgtggaca 7140 
gaggagccat aactgcagac ttgtgggata cagaagacca atgcagactt taatgtcttt 7200- 
tctcttacac taagcaataa agaaataaaa attgaacttc tagtatccta tttgtttaaa 7260 
ctgctagctt tacttaactt ttgtgcttca tctatacaaa gctgaaagct aagtctgcag 7320 
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ccattactaa 
gtttagccag 
gagacaggca 
atagccagga 
tataattgaa 
gatgagattt 
ggctttcttt 
gcttaaatgg 
ccagggtggg 
aaatgggtta 
aatgagtccc 
tttaaatgtg 
atcgcttaat 
tcaaattaag 
gcttttcctg 
actaaagttc 
ggaacttgca 
gtaaaatgtg 
aaaaaaagcc 
agagtagtta 
aagaaactac 
aatattccaa 
cggtgaatag 
ttcaagtgat 
gtttatactc 
taccttaggc 
tttcatatat 
taggtctttg 
atgtcttaca 
. taaccctacc 
gaagttgaga 
aagatattca 
gagcattgag 
ccttttttaa 
aaatacttgt 
cctttttttt 
. ctggaactca 
cctgagtgcc 
tataactata 
gtggtcttta 
tagtggcact 
ctagtctagc 
caggacatct 
ctaccgggta 
gctgggcact 
taggcgccaa 
agtcaggaag 
agcacgtctc 
cctttggggc 
aaggggtggg 
gtcctccgga 
ctcttcctca 
ctcgtacccc 
tagcaaccga 
ggagcagaaa 
aaccaccacc 
cgagccgatg 
cacacaacac 
aagcgcccag 
tatcgggggg 
ccgccatccc 
gaccccccag 
catcgtgctt 



acatgaaagc 
ccagtggtgg 
gatctctgag 
atacacacag 
gcattcccta 
gggtattatt 
aaagcctcct 
catggctttt 
gttgggaaat 
tataataaac 
tcccctataa 
gtatctttag 
cagattttct 
aatttctgac 
aggcagggtc 
tgcctgcttt 
ggtcagattc 
tttcaagtga 
aggcctgtta 
cttggcaggc 
agttaaggtt 
ataatatttc 
gcagagttgg 
agttaaagcc 
agaagcactt 
tgattgttgt 
caaggcaaaa 
tgggtttgat 
tttggtggaa 
tatagagcac 
cgtt'ttgtta 
ggactatttt 
tggcctttag 
aaagcccttg 
ttggatctcc 
tttttttttg 
ctttgtagac 
gggattaaag 
accaatacta 
ttggcccttc 
taagcaactt 
gttttgtata 
agcacgtgtt 
ggggaggcgc 
tggcgctaca 
ccggctccgt 
ttcccccccg 
actagtctcg 
agcggccaat 
tccgggggcg 
ggcccggcat 
tctccgggcc 
ggccatcaac 
cgtacggcgt 
atgcccacgc 
acgcaactgc 
acttactggc 
cgcctcgacc 
ataacaatgg 
gaggctggga 
atcgccgccc 
gccgtgctgg . 

ggggcccttc 



aagtaatgat 
tgcttgcctt 
tttgagccca 
aaaccctgtt 
atgagccact 
ttttctgtct 
tcctgccatg 
tgccttctaa 
ctttcaaaca 
ctcataaaat 
aagagctatt 
aaccaagggt 
agtttcaaat 
ttttaatgtt 
tcactatgta 
tgtctcctga 
ttttataggg 
tgaaaacttg 
gagcatgctt 
tcctggtgag 
acctgtcttt 
attcaagttt 
aaactaaaca 
catgagatac 
ctttgggttt 
tcaaataaac 
catgttatat 
ttttaatttt 
ttgtataatt 
tttgctgggt 
gtgtatacta 
tgactgggct 
gcttgaattg 
ttctttatca 
ttttgacaac 
gtttttcgag 
caggctggcc 
gcgtgcacca 
actccactgg 
attaaaatct 
cctacggata 
cctaccagct 
aactcgagct 
ttttcccaag 
caagtggcct 
tctttggtgg 
ccccgcagct 
tgcagatgga 
agcagctttg 
ggctcagggg 
tctgcacgct 
tttcgacctg 
acgcgtctgc 
tgcgccctcg 
tactgcgggt 
tggtggccct 
gggtgctggg 
agggtgagat 
gcatgcctta 
gctcacatgc 
tcctgtgcta 
cgttcgtggc 
cggaggacag 
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aattttggat 

tatgccttta 

gcctggtcta 

ggggaggggg 

atggatgtgg 

ctgctgttgg 

tggtctcttg 

gggcagctgc 

ctaaaattgt 

agttatgagg 

aaggcttttt 

cttagagttt 

ccagagaatc 

aatttgctta 

tctctgcctg 

atactaaggt 

gacacactaa 

aattattatc 

aagggatccc 

agcatatttc 

aaaccatctg* 

tcccccatca 

aatgttggtt 

agaacaaagc 

ccctgcacta 

ttaagtttcc 

atgttaaaca 

caaaacctga 

gtggtttgca 

cacaagtcta 

gtttatatgt 

aaggaattga 

gagtcacttg 

ccctgttttc 

aatagcatgt 

acagggtttc 

tcgaactcag 

ccacgcctgg 

gtggattttt 

actgttcact 

tactagcaga 

ttatactacc 

gcaggattcg 

gcagtctgga 

ctggcctcgc 

Qcccttcgcg 

cgcgtcgtgc 

cagcaccgct 

ctccttcgct 

cgggctcagg 

tcaaaagcgc 

cagccaatgc 

gttcgaccag 

ccggcagcaa 

ttatatagac 

gggttcgcgc 

ggcttccgag 

atcggccggg 

tgccgtgacG 

cccgcccccg 

cccggccgcg 

cctcatcccg 

acacatcgac 



ttcaaaaatg 

atcccagcac 

cacatcaagt 

gctctgagat 

ctaaatccgt' 

ttgggtcttt 

tttgctacta 

tgagatttgc 

cctttaattt 

agtgaggtgg 

gtcttatact 

tagtatacag 

caaattcttc 

ctgtgaatat 

atctgcaaca 

taaaatgtag 

gggagcttgg 

accgcaacct 

taggacttgc 

aaaaaacaag 

catatacaca 

aattgggaca 

ttgtgatttg 

tgctatttcg 

tcctgatcat 

tgtcaggtga. 

tttgtactta 

gctaaataag 

ggcaagactc 

ggagtcaagc 

tggaggacat 

ttctgattag 

tatatctcaa 

tacataattt 

tttcaagcca 

tctgtatagc 

aaatccgcct 

ctaagttgga 

aattcagtca 

ctaacagagg 

ttaagggtca 

ttgttctgat 

agggccccgg 

gcatgcgctt 

acacattcca 

ccaccttcta 

aggacgtgac 

gagcaatgga 

ttctgggctc 

ggcggggcgg 

acgtctgccg 

accgtccttg 

gctgcgcgtt 

gaagccacgg 

ggtccccacg 

gacgatatcg 

acaatcgcga 

gacgcggcgg 

gacgccgttc 

gccctcaccc 

cggtacctta 

ccgaccttgc 

cgcctggcca 



tagggccaga 

tctggaggca 

tctatctagg 

ttcataaaat 

ctacctttct 

tgacactgtg 

acttcccatg 

agcctgattt 

tttttttaaa 

actaatatta 

taactttttt 

aaactgttgc 

acagccaaag 

aaaaatgata 

agatatgtag 

taatactttt 

gtgatagttg 

actttttaaa 

tgagcacaca 

gcagacaacc 

gggatattaa 

tggatttctc 

tgaaattgtt 

aggtctcttg 

gtgctaggcc 

tgtcatatga 

atgtgaaagt 

tcatttttac 

tctgacctag 

atttcacctt 

gtttatccag 

cactgttagt 

ataatgctgg 

ttgttcaaag 

tatttttttt 

cctggctgtc 

gcctctgcct 

tattttgtta 

gtagtcttaa 

ctgttggtac 

gggatagaaa 

agaaatattt 

caggtcaatt 

tagcagcccc 

catccaccgg 

ctcctcccct 

aaatggaagt 

agcgggtagg 

agaggctggg 

gcgcccgaag 

cgctgttctc 

ccatcatggc 

ctcgcggcca 

aagtccgccc 

ggatggggaa 

tctacgtacc 

acatctacac 

tggtaatgac 

tggctcctca 

tcatcttcga 

tgggcagcaf 

ccggcaccaa . 

aacgccagcg 
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7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 . 

7920 • 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 
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' • ccccggcgag cggctggacc tggctatgct ggctgcgatt cgccgcgttt acgggctact 11160 
tgccaatacg. gtgcggtatc tgcagtgcgg cgggtcgtgg cgggaggact ggggacagct 11220 
ttcggggacg gccgtgccgc cccagggtgc cgagccccag agcaacgcgg gcccacgacc 11280 
• ccatatcggg gacacgttat ttaccctgtt tcgggccccc gagttgctgg cccccaacgg 11340 
cgacctgtat aacgtgtttg cctgggcctt ggacgtcttg gccaLaacgcc tccgttccat 11400 
gcacgtcttt atcctggatt acgaccaatc gcccgccggc tgccgggacg ccctgctgca 114 60 
acttacctcc gggatggtcc agacccacgt caccaccccc ggctccatac cgacgatatg 11520 
cgacctggcg cgcacgtttg cccgggagat gggggaggct aactgagggg atcgatccgt 115B0 
cctgtaagtc tgcagaaatt gatgatctat taaacaataa agatgtccac taaaatggaa 11640 
gtttttcctg tcatactttg ttaagaaggg tgagaacaga gtacctacat - tttgaatgga 11700 
aggattggag ctacgggggt gggggtgggg tgggattaga taaatgcctg ctctttactg 11760 • 
aaggctcttt actattgctt tatgataatg tttcatagtt ggatatcata atttaaacaa 11820 
gcaaaaccaa attaagggcc agctcattcc tcccactcat gatctataga tctatagatc 11880 
■ tctcgtggga tcattgtttt tctcttgatt cccactttgt ggttctaagt actgtggttt 11940 
ccaaatgtgt cagtttcata gcctgaagaa cgagatcagc agcctctgtt ccacatacac 12000 
ttcattctca gtattgtttt gccaagttct aattccatca gaagctgact ctaggccgag 12060 
ctccaattcg ccctatagtg agtcgtatta caattcactg gccgtcgttt tacaacgtcg 12120 
tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc 12180 
cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct 12240 
gaatggcgaa tgggacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac 12300 
gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc 12360 
ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt 12420 
agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt agggtgatgg 12480 
ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac 12540 
gttctttaat agtggactct tgttccaaac .tggaacaaca ctcaacccta tctcggtcta 12600 
ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat 12660 
ttaacaaaaa tttaacgcga attttaacaa aatattaacg cttacaattt aggtggcact 12720 
tttcggggaa atgtgcgcgg aacccctatt tgtttatttf tctaaataca ttcaaatatg 12780 
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt 128 40 
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 12900 
gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca 12960 
cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc 13020 
gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc 13080 
cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg 13140 
gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta 13200 
tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc 13260 
ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt 13320 
gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg 13380 
cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct 13440 
tcccggcaac- aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc 13500 
tcggcccttc .cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct 13560 
cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac 13620 
acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc 13680 
• tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat 13740 
ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg 13800 
accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc 138 60 
aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa 13920 
ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag 13980 
gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta 14040 
ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta 14100 
ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag 14160 
ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg 14220 
gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg 14280 
cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag 14340 
cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc 14400 
cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa 144 60 
•aacgcqagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg 14520 
ttctttcctg cgttatcccc' tgattctgtg gataaccgta ttaccgcctt tgagtgagct 14580 
gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa 14 640 * 
gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta .atgcagctgg 14700 * 
cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag 147 60 

' ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga 14820 
attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta cgccaagcgc 14880 
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gcaattaacc ctcactaaag ggaacaaaag ctgtcgagat ctagatatcg atggccatag 14940 
agttacg 14947- 



<210> 5 

<211> 4665 

<212> DNA ■ 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Sequence of 
Rluc-Hl-shRNA neo insert 



<400> 5 

tctaggtaac 

tttccttgat 

caaactcttc 

tctaagtaag 

gaccccgagc 

atgaacgtgc 

gtgatttttc 

atcgagcccg 

cigcgggaatg 

ctgctgaacc 

tttcactact 

gtggacgtga 

aagagcgaag 

ccaagcaaga 

aaggagaagg 

aagggaggca 

agcgacgatc 

gtcgagggag 

agccaggagg 

ctgaagaacg 

cattattttc 

gaatgaggcg 

agaatgacct 

gccagggatc 

actactcaag 

attcctgcgc 

gtggcatccc 

gcccaccagc 

ctataatatt 

ctgtagggcc 

cactgcaatc 

gggattccag 

tttcaccata 

gcctcccaaa 

tttaaaataa 

ccaaccggtg 

tcagtagatg 

tatgctatac 

ccccccgcat 

ctccgtcgcc 

aatcgcgggc 

atggctgtga 

ggaaatcacc 

accactcttt 

ctctcgctga 

tatacgaagt 

tcggaatagg 

gtctggagca 

gcctcgcaca 



cgatatccct 
gatgtcatac 
gcggtctttc 
cttggcattc 
aacgcaaacg 
tggactcctt 
tgcatggtaa 
tggctagatg 
gctcatatcg 
ttccaaagaa 
cctacgagca 
tcgagtcctg 
agggcgagaa 
tcatgcggaa 
gcgaggttag 
agcccgacgt 
tgcctaagat 
ctaagaagtt 
acgctccaga 
agcagtaatt 
aatagatctg 
cggccaaggg 
tgggggaggg 
ttcaagcaga 
aactacgggc 
atcgtgcagt 
tgtgacccct 
cttgtcctaa 
atggggtgga. 
tgcggggtct 
tccgcctcct 
gcatgcatga 
ttggccaggc 
ttgctgggat 
ctataccagc 
ggacatttga 
cctgttgaat 
gaagttatgg 
gccgtcccgc 
gccgcgccgc 
ccagtgtcac 
gggacagggg 
ataaacgtga 
cccaggattc 
gttggaatcc 
tatggcgcgc 
aacttcttag 
tgcgctttag 
cattccacat 



gcaggggtga 

ttatcctgtc 

cagtactcct 

cggtactgtt 

catgatcact 

catcaactac 

cgctgcctcc 

catcatccct 

cctcctggat 

aatcatcttt 

ccaagacaag 

ggacgagtgg 

aatggtgctt 

actggagcct 

acggcctacc 

cgtccagatt 

gttcatcgag 

ccctaacacc 

tgaaatgggt 

ctagaccggt 

tgtgttggtt 

ggagggggag 

ggaggccaga 

cctacagcaa 

tgctctactg 

gccgctctgt 

ccccagtgcc- 

taaaattaag 

ggggggtggt 

attgggaacc 

gggttcaagc 

ccaggctcag 

tggtctccaa 

tacaggcgtg 

aggaggacgt 

gttgcttgct- 

taagcttatt 

atcctcacag 

gatattgagc 

catggaattc 

taggcgggaa 

agtggcgccc 

aatgtctttg 

caattcagcg 

atttttttct 

cggtaaccga 

gtcaattcta 

cagccccgct 

ccaccggtag 



cctgcacgtc 
cctttttttt 
gcaggtgact 
ggtaaagcca 
gggcctcagt 
tatgattccg 
agctacctgt 
gatctgatcg 
cactacaagt 
gtgggccacg 
atcaaggcca 
cctgacatcg 
gagaataact 
gaggagttcg 
ctctcctggc 
gtccgcaact 
tccgaccctg 
gagttcgtga 
aagtacatca 
tcgagatcca 
ttttgtgtgc 
gccagaatga 
atgaggcgcg 
gttcgacaca 
cttcaggaag 
ggagggcagc 
tctcctggcc 
ttgcatcatt 
atggagcaag 
aagctggagt 
gattctcctg 
ctaatttttg 
ctcctaatct 
aaccactgct 
ccagacacag 
tggcactgtc 
taaataggcc 
taggtggcat 
tccgaacctc 
gaacgctgac 
cacccagcgc 
tgcaatattt 
gatttgggaa 
ggagccacct 
agactcgaga 
agttcctata 
ccgggtaggg 
gggcacttgg 
gcgccaaccg. 



tagggcgcag 
ccacagctcg 
gactgagtcg 
ccatggcttc 
ggtgggctcg 
agaagcacgc 
ggaggcacgt 
gaatgggtaa 
acctcaccgc 
actggggggc 
tcgtccatgc 
^ggaggatat 
tcttcgtcga 
ctgcctacct 
ctcgcgagat 
acaacgccta 
ggttcttttc 
aggtgaaggg 
agagcttcgt 
ggcgcggatc 
cttgggggag 
ccttggggga 
gatccgtcga 
aactcacaca 
gacatggaca 
tgtggcttct 
ctggaagttg 
ttgtctgact 
gggcaagttg 
gcagtggcac 
cctcagcctc 
tttttttggt 
caggtgatct 
cccttccctg 
cataggctac 
ctctcatgcg 
ggccataact 
cgttcctttc 
tcgccctgcc 
gtcatcaacc 
gcgtgcgccc 
gcatgtcgct 
tcttataagt 
gatgaagctt 
taacttcgta 
ctttctagag 
gaggcgcttt 
cgctacacaa 
gctccgttct 



tagtccaggg 
cggttgagga 
agatctgcga 
caaggtgtac 
ctgcaagcaa 
cgagaacgcc 
cgtgcctcac 
gtccggcaag 
ttggttcgag 
ttgtctggcc 
tgagagtgtc 
cgccctgatc 
gaccatgctc 
ggagccattc 
ccctctcgtt 
ccttcgggcc 
caacgctatt 
cctccacttc 
ggagcgcgtg 
aataaaagat 
ggggaggcca 
gggggaggcc 
cttaattaag 
acgatgacgc 
aggtcgagac 
agctgcccgg 
ccactccagt 
aggtgtcctt 
ggaagacaac 
aatcttggct 
ccgagttgtt 
agagacgggg 
acccaccttg 
tccttctgat 
ctggccatgc 
ttgggtccac 
tcgtataatg 
tgactgcccg 
gccgccggtg 
cgctccaagg 
tggcaggaag 
atgtgttctg 
tctgtatgag 
gatcgggtgg 
taatgtatgc 
aataggaact 
tcccaaggca 
gtggcctctg 
ttggtggccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340' 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 



wo 2004/035782 



cttcgcgcca 
gtcgtgcagg 
caccgctgag 
cttcgctttc 
gctcaggggc 
aaagcgcacg 
ccaatatggg 
tggagaggct 
tgttccggct 
ccctgaatga 
cttgcgcagc 
aagtgccggg 
tggctgatgc 
aagcgaaaca 
atgatctgga 
cgcgcatgcc 
tcatggtgga 
accgctatca 
gggctgaccg 
tctatcgcct 
tgatgatcta 
gttaagaagg 
tgggggtggg 
ttatgataat 
cagctcattc 
ttctcttgat 
agcctgaaga 
tgccaagttc 
tctagagaat 



ccttctactc 
acgtgacaaa 
caatggaagc 
tgggctcaga 
ggggcgggcg 
tctgccgcgc 
atcggccatt 
attcggctat 
gtcagcgcag 
actgcaggac 
tgtgctcgac 
gcaggatctc 
aatgcggcgg 
tcgcatcgag 
cgaagagcat 
cgacggcgag 
aaatggccgc 
ggacatagcg 
cttcctcgtg 
tcttgacgag 
ttaaacaata 
gtgagaacag 
gtgggattag 
gtttcatagt 
ctcccactca 
tcccactttg 
acgagatcag 
taattccatc 
aggaacttcg 



ctcccctagt 
tggaagtagc 
gggtaggcct 
ggctgggaag 
cccgaaggtc 
tgttctcctc 
gaacaagatg 
gactgggcac 
gggcgcccgg 
gaggcagcgc 
gttgtcactg 
ctgtcatctc 
ctgcatacgc 
cgagcacgta 
caggggctcg 
gatctcgtcg 
ttttctggat 
ttggctaccc 
ctttacggta 
ttcttctgag 
aagatgtcca 
agtacctaca 
ataaatgcct 
tggatatcat 
tgatctatag 
tggttctaag 
cagcctctgt 
agaagctgac 
gaataggaac 
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caggaagttc 
acgtctca'ct 
ttggggcagc 
gggtgggtcc 
ctccggaggc 
ttcctcatct 
gattgcacgc 
aacagacaat 
ttctttttgt 
ggctatcgtg 
aagcgggaag 
accttgctcc 
ttgatccggc 
ctcggatgga 
cgccagccga 
tgacccatgg 
tcatcgactg 
gtgatattgc 
tcgccgctcc 
gggatcgatc 
ctaaaatgga 
ttttgaatgg 
gctctttact 
aatttaaaca 
atctatagat 
tactgtggtt 
tccacataca 
tctagatccc 
ttcaagctta 
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ccccccgccc 
agtctcgtgc 
ggccaatagc 
gggggcgggc 
ccggcattct 
ccgggccttt 
aggttctccg 
cggctgctct 
caagaccgac 
gctggccacg 
ggactggctg 
tgccgagaaa 
tacctgccca 
agccggtctt 
actgttcgcc 
cgatgcctgc 
tggccggctg 
tgaagagctt 
cgattcgcag 
cgctgtaagt 
agtttttcct 
aaggattgga 
gaaggctctt 
agcaaaacca 
ctctcgtggg 
tccaaatgtg 
cttcattctc 
gcgccgaagt 
agcgc 



cgcagctcgc 
agatggacag 
agctttgctc 
tcaggggcgg 
gcacgcttca 
cgacctgcag 
gccgcttggg 
gatgccgccg 
.ctgtccggtg 
acgggcgttc 
ctattgggcg 
gtatccatca 
ttcgaccacc 
gtcgatcagg 
aggctcaagg 
. ttgccgaata 
ggtgtggcgg 
ggcggcgaat 
cgcatcgcct 
ctgcagaaat 
gtcatacttt 
gctacggggg 
tactattgct 
aattaagggc 
atcattgttt 
tcagtttcat 
agtattgttt 
tcctatactt 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4665 



<210> 6 
<211> 15199 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description .of Artificial Sequence: Targeting 
vector for Rosa26 locus with Rluc-Hl-shRNA neo 
insert 



<400> 6 

ctagggataa 

tgagacagcc 

cggtccgccc 

cgcggccatg 

gctgattggc 

cgtaaggcgc 

cccactgggt 

gcctctggcg 

ccgcccaccc 

gtctgattgg 

gttgagtcca 

ccctcggccc 

gcgctggggg 

tccgacttga 

gagtggaggg 

tcccaaagtc 

ggccgcaccc 

gctgcctcct 

tcgtgatctg 

gtctagggcg 

tttccacagc 



cagggtaata 
gggtacgagt 
tgcagcaacc 
gctcgggggg 
ttcttttcct 
ctgtcagtta 
ggggcgggag 
gggcggggga 
tccccttcct 
ctctcggggc 
tccgccggcc 
cgcgccgcag 
cggggacggg 
gttgcctcaa 
aaggagcgag 
gctctgagtt 
ttctccggag 
ggcttctgag 
caactccagt 
cagtagtcca 
tcgcggttga 



tagccgcggc 
cgtgacgctg 
ggagggggag 
gggggggcag 
cccgccgtgt 
acggcagccg 
gtaggtgggg 
ggggagggag 
ctgggggagt 
ccagaaaact 
agcgggggcg 
agtctggccg 
cagtagggct 
gaggggcgtg 
ggctcagttg 
gttatcagt'a 
gggggagggg 
gaccgccctg 
ctttctaggt 
gggtttcctt 
ggacaaactc 



aggccctccg 
gaaggggcaa 
ggagaaggga 
cggaggagcg 
gtgaaaacac 
gagtgcgcag 
tgaggcgagc 
ggtcagcgaa 
cgttttaccc 
ggcccttgcc 
gcgaggaggc 
cgcgcccctg 
gagcggctgc 
ctgagccaga 
ggctgttttg 
agggagctgc 
agtgttgcaa 
ggcctgggag 
aaccgatatc 
gatgatgtca 
ttcgcggtct 



agcgtggtgg 
gcgggtggtg 
gcggaaaagt 
cttccggccg 
aaatggcgtg 
ccgccggcag 
tggacgtgcg 
agtagctcgc 
gccgccggcc 
attggctcgt 
gctcccaggt 
cgcaacgtgg 
ggggcgggtg 
cctccatcgc 
gaggcaggaa 
agtggagtag 
tacctttctg 
aatcccttcc 
cctgcagggg 
tacttatcct 
ttccagtact 



agccgttctg 
ggcaggaatg 
ctccaccgga 
acgtctcgtc 
ttttggttgg 
cctcgctctg 
ggcgcggtcg 
gcgcgagcgg 
gggcctcgtc 
gttcgtgcaa 
tccggccctc 
caggaagcgc 
caagcacgtt 
gcactccggg 
gcacttgctc 
gcggggagaa 
ggagttctct 
ccctcttccc 
tgacctgcac 
gtcccttttt 
cctgcaggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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actgactgag 
ccaccatggc 
agtggtgggc 
ccgagaagca 
tgtggaggca 
tcggaatggg 
agtacctcac 
acgactgggg 
ccatcgtcca 
tcgaggagga 
acttcttcgt 
. tcgctgccta 
ggcctcgcga 
actacaacgc 
ctgggttctt 
tgaaggtgaa 
tcaagagctt 
ccaggcgcgg 
tgccttgggg 
tgaccttggg 
gcggatccgt 
acaaactcac 
aaggacatgg 
agctgtggct 
gccctggaag 
attttgtctg 
aaggggcaag 
agtgcagtgg 
ctgcctcagc 
ttgttttttt 
tctcaggtga 
gctcccttcc 
cagcataggc 
gtcctctcat 
gccggccata 
catcgttcct 
ctctcgccct 
gacgtcatca 
cgcgcgtgcg 
tttgcatgtc 
gaatcttata 
cctgatgaag 
agataacttc 
atactttcta 
ggggaggcgc 
tggcgctaca 
ccggctccgt 
ttcccccccg 
actagtctcg 
agcggccaat 
tccgggggcg 
ggcccggcat 
tctccgggcc 
cgcaggttct 
aatcggctgc 
tgtcaagacc 
gtggctggcc 
aagggactgg 
tcctgccgag 
ggctacctgc 
ggaagccggt 
cgaactgttc 
tggcgatgcc 



tcgagatctg 
ttccaaggtg 
tcgctgcaag 
cgccgagaac 
cgtcgtgcct 
taagtccggc 
cgcttggttc 
ggcttgtctg 
tgctgagagt 
tatcgccctg 
cgagaccatg 
cctggagcca 
gatccctctc 
ctaccttcgg 
ttccaacgct 
gggcctccac 
cgtggagcgc 
atcaataaaa 
gagggggagg 
ggagggggag 
cgacttaatt 
acaacgatga 
acaaggtcga 
tctagctgcc 
ttgccactcc 
actaggtgtc 
ttgggaagac 
cacaatcttg 
ctcccgagtt 
ggtagagacg 
tctacccacc 
ctgtccttct 
tacctggcca 
gcgttgggtc 
acttcgtata 
ttctgactgc 
gccgccgccg 
acccgctcca 
ccctggcagg 
gctatgtgtt 
agttctgtat 
cttgatcggg 
gtataatgta 
gagaatagga 
ttttcccaag 
caagtggcct 
tctttggtgg 
ccccgcagct 
tgcagatgga 
agcagctttg 
ggctcagggg 
tctgcacgct 
tttcgacctg 
ccggccgctt 
tctgatgccg 
gacctgtccg 
acgacgggcg 
ctgctattgg 
aaagtatcca 
ccattcgacc 
cttgtcgatc 
gccaggctca 
tgcttgccga 



cgatctaagt 

tacgaccccg 

caaatgaacg 

gccgtgattt 

cacatcgagc 

aagagcggga 

gagctgctga 

gcctttcact 

gtcgtggacg 

atcaagagcg 

ctcccaagca 

ttcaaggaga 

gttaagggag 

gccagcgacg 

attgtcgagg 

ttcagccagg 

gtgctgaaga 

gatcattatt 

ccagaatgag 

gccagaatga 

aaggccaggg 

cgcactactc 

gacattcctg 

cgggtggcat 

agtgcccacc 

cttctataat 

aacctgtagg 

gctcactgca 

gttgggattc 

gggtttcacc 

ttggcctccc. 

gattttaaaa 

tgcccaaccg 

cactcagtag 

atgtatgcta 

ccgccccccg 

gtgctccgtc 

aggaatcgcg 

aagatggctg 

ctgggaaatc 

gagaccactc 

tggctctcgc 

tgctatacga 

acttcggaat 

gcagtctgga 

ctggcctcgc 

ccccttcgcg 

cgcgtcgtgc 

cagcaccgct 

ctccttcgct 

cgggctcagg 

tcaaaagcgc . 

cagccaatat 

gggtggagag 

ccgtgttccg 

gtgccctgaa 

ttccttgcgc 

gcgaagtgcc 

tcatggctga 

accaagcgaa 

aggatgatct 

aggcgcgcat 

atatcatggt 



14/39 

aagcttggca 

agcaacgcaa 

tgctggactc 

ttctgcatgg 

ccgtggctag 

atggctcata 

accttccaaa 

actcctacga 

tgatcgagtc 

aagagggcga 

agatcatgcg 

agggcgaggt 

gcaagcccga 

atctgcctaa 

gagctaagaa 

aggacgctcc 

acgagcagta 

ttcaatagat 

gcgcggccaa 

ccttggggga 

atcttcaagc 

aagaactacg 

cgcatcgtgc 

ccctgtgacc 

agccttgtcc 

attatggggt 

gcctgcgggg 

atctccgcct 

caggcatgca 

atattggcca 

aaattgctgg 

taactatacc 

gtgggacatt 

atgcctgttg 

tacgaagtta 

catgccgtcc 

gccgccgcgc 

ggcccagtgt 

tgagggacag 

accataaacg 

tttcccagga 

tgagttggaa 

agttatggcg 

aggaacttct 

gcatgcgctt 

acacattcca 

ccaccttcta 

aggacgtgac 

gagcaatgga 

ttctgggctc 

ggcggggcgg 

acgtctgccg 

gggatcggcc 

gctattcggc 

gctgtcagcg 

tgaactgcag 

agctgtgctc 

ggggcaggat 

tgcaatgcgg 

acatcgcatc 

ggacgaagag . 

gcccgacggc 

ggaaaatggc 
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ttccggtact 

acgcatgatc 

cttcatcaac 

taacgctgcc 

atgcatcatc 

tcgcctcctg 

gaaaatcatc 

gcaccaagac 

ctgggacgag 

gaaaatggtg 

gaaactggag 

tagacggcct- 

cgtcgtccag 

gatgttcatc 

gttccctaac 

agatgaaatg 

attctagacc 

ctgtgtgttg 

gggggagggg 

gggggaggcc 

agacctacag 

ggctgctcta 

agtgccgcte 

cctccccagt 

taataaaatt 

ggaggggggt 

tctattggga 

cctgggttca 

tgaccaggct 

ggctggtctc 

gattacaggc 

agcaggagga 

tgagttgctt 

aattaagctt 

tggatcctca 

cgcgatattg 

cgccatggaa 

cactaggcgg 

gggagtggcg 

tgaaatgtct 

ttccaattca 

tccatttttt 

cgccggtaac 

taggtcaatt 

tagcagcccc 

catccaccgg 

ctcctcccct 

aaatggaagt 

agcgggtagg 

agaggctggg 

gcgcccgaag 

cgctgttctc 

attgaacaag 

tatgactggg 

caggggcgcc 

gacgaggcag ' 

gacgttgtca 

ctcctgtcat 

cggctgcata 

gagcgagcac 

catcaggggc 

gaggatctcg 

cgcttttctg 



gttggtaaag 
actgggcctc 
tactatgatt 
tccagctacc 
cctgatctga 
gatcactaca 
tttgtgggcc 
aagatcaagg 
tggcctgaca 
cttgagaata 
cctgaggagt 
accctctcct 
attgtccgca 
gagtccgacc 
accgagttcg 
ggtaagtaca 
ggttcgagat 
gttttttgtg 
gaggccagaa 
agaatgaggc 
caagttcgac 
ctgcttcagg 
tgtggagggc 
gcctctcctg 
aagttgcatc 
ggtatggagc 
accaagctgg 
agcgattctc 
cagctaattt 
caactcctaa 
gtgaaccact 
cgtccagaca 
gcttggcact 
atttaaatag 
cagtaggtgg 
agctccgaac 
ttcgaacgct 
gaacacccag 
ccctgcaata 
ttggatttgg 

gcgggagcca 

tctagactcg 
cgaagttcct 
ctaccgggta 
gctgggcact 
taggcgccaa 
agtcaggaag 
agcacgtctc 
cctttggggc 
aaggggtggg 
gtcctccgga 
ctcttcctca 
atggattgca 
cacaacagac 
cggttctttt 
cgcggctatc 
ctgaagcggg 
ctcaccttgc 
cgcttgatcc 
gtactcggat 
tcgcgccagc 
tcgtgaccca 
gattcatcga 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4 620 
4680 
4740 
4800 
4860 
4 920 
4980 
5040 
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ctgtggccgg 
tgqtgaagag 
tcccgattcg 
atccgctgta 
ggaagttttt 
tggaaggatt 
actgaaggct 
acaagcaaaa 
gatctctcgt 
gtttccaaat 
acacttcatt 
cccgcgccga 
ttaagcgcta 
gggcgttgtc 
gattttcggt 
gtagtcatct 
ggagtatttt 
ttgagatcct 
tttaatcatt 
ttatcaaaag 
atccaacccc 
tcatgaaacc 
acactgcagt 
taattttcag 
gtccagggag 
ctcccatgaa 
gttccaggtg 
tattttaaga 
actaaaattt 
aggtgtttgt 
agtgctttct 
gtgattttgc 
taaaatgatt 
gtgcttgtgc 
cacagactgc 
accatctgta 
acattaaata 
tccagtagaa 
aatcaaatta 
taactgtgga 
tttaatgtct 
tatttgttta 
ctaagtctgc 
tgtagggcca 
actctggagg 
gttctatcta 
atttcataaa 
gtctaccttt 
tttgacactg 
taacttccca 
gcagcctgat 
ttttttttta 
ggactaatat 
cttaactttt 
agaaactgtt 
tcacagccaa 
ataaaaatga 
caagatatgt 
agtaatactt 
gggtgatagt 
ctacttttta 
gctgagcaca 
aggcagacaa 



ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
agtctgcaga 
cctgtcatac 
ggagctacgg 
ctttactatt 
ccaaattaag 
gggatcattg 
gtgtcagttt 
ctcagtattg 
agttcctata 
gaagatgggc 
ctgcagggga 
tttgtcggga 
ggggttttat 
ccatcgaggt 
tactacagta 
tttaaagagc 
gtattttaga 
tagacagagc 
agacagatta 
tcttttataa 
tgtctatcac 
ttttacaaca 
tgcctctctc 
gatgtctcct 
cattaaaagg 
tgtcattgta 
tgtctttcct 
cttggactgg 
caagcttctt 
aagcaacaaa 
tctataataa 
tcttccagaa 
atgggatctg 
aataaatcct 
tttactgaag 
ctctttaagc 
cagaggagcc 
tttctcttac 
aactgctagc 
agccattact 
gagtttagcc 
cagagacagg 
ggatagccag 
attataattg 
ctgatgagat 
tgggctttct 
tggcttaaat 
ttccagggtg 
aaaaatgggt 
taaatgagtc 
tttttaaatg 
gcatcgctta 
agtcaaatta 
tagcttttcc 
agactaaagt 
ttggaacttg 
tggtaaaatg 
aaaaaaaaag 
caagagtagt 
ccaagaaact . 



cggaccgcta 
aatgggctga 
ccttctatcg 
aattgatgat 
tttgttaaga 
gggtgggggt 
gctttatgat 
ggccagctca 
tttttctctt 
catagcctga 
ttttgccaag 
ctttctagag 

gggagtcttc 

attgaacagg 

agttttttaa 

gcagcaaaac 

agattaaaga 

tgaaattaca 

ccagtacttc 

acactcattt 

attggcattt 

gttacataca. 

ctccttagta 

ctctcccgtc 

atagatgtat 

ctttttctcc 

ccccatatta 

tatatttcat 

cacatctgta 

gacctaaggt 

cttgactcat 

caggacctat 

tgtatttgtg 

tactatccag 

gtcctgagtt 

atgccctctt 

ccttcttctt 

taatgaaata 

actggaaatg 

ataactgcag 

actaagcaat 

tttacttaac 

aaacatgaaa 

agccagtggt 

cagatctctg 

gaatacacac 

aagcattccc 

ttgggtatta 

ttaaagcctc 

ggcatggctt 

gggttgggaa 

tatataataa 

cctcccctat 

tggtatcttt 

atcagatttt 

agaatttctg 

tgaggcaggg 

tctgcctgct 

caggtcagat 

tgtttcaagt 

ccaggcctgt 

tacttggcag 

acagttaagg 



15/39 

tcaggacata 

ccgcttcctc 

ccttcttgac 

ctattaaaca* 

agggtgagaa 

ggggtgggat 

aatgttt'cat 

ttcctcccac 

gattcccact 

agaacgagat 

ttctaattcc 

aataggaact 

tgggcaggct 

tgtaaaattg 

taggggcaaa 

tacaggttat 

catgctcacc 

gtgtcgcgag 

atatccattt 

tagccccatt 

tccctttcct 

ccacaaatcg 

cactttttgt 

agtggtgttc 

tgagaatcca 

atttataaac 

cctgatgtat 

tattgagcca 

aaaggtggtt 

cttgtgagct 

ggcattctac 

aattttgctt 

aagcttggtt 

gggctggaga 

caattcccag 

ctggtgtgtc 

cttttttttt 

ctttgtgttt 

ttaccaagga 

acttgtggga 

aaagaaataa 

ttttgtgctt 

gcaagtaatg 

ggtgcttgcc 

agtttgagcc 

agaaaccctg 

taatgagcca 

ttttttctgt 

cttcctgcca 

tttgccttct 

atctttcaaa 

acctcataaa 

aaaagagcta 

agaaccaagg 

ctagtttcaa 

acttttaatg 

tctcactatg 

tttgtctcct 

tcttttatag 

gatgaaaact 

tagagcatgc 

gctcctggtg 

ttacctgtct 
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gcgttggcta 

gtgctttacg 

gagttcttct 

ataaagatgt 

cagagtacct 

tagataaatg 

agttggatat 

tcatgatcta 

ttgtggttct 

cagcagcctc 

atcagaagct 

tcggaatagg 

taaaggctaa 

gagggacaag 

taaggaaaat 

tattgcttgt 

cgagttttat 

ttagactatg 

ctcccgctcc 

ttcatttatt 

gatcttagaa 

aggctgtagc 

tgatcctttg 

cacatttggg 

acctaaagct 

tgagctatta 

cttacatatt . 

catggtattg 

ccttttggaa 

tgtatttttt 

acgttattgc 

gacttgtagc 

tttaggttgt 

ggtggctcgg 

caaccacatg 

tgaagaccac 

tttttaaaga 

gttccaatat 

actaattttt 

tacagaagac 

aaattgaact 

catctataca 

ataattttgg 

tttatgcctt 

cagcctggtc 

ttggggaggg 

ctatggatgt 

ctctgctgtt 

tgtggtctct 

aagggcagct 

cactaaaatt 

atagttatga 

ttaaggcttt 

gtcttagagt 

atccagagaa 

ttaatttgct 

tatctctgcc 

gaatactaag 

gggacacact 

tgaattatta 

ttaagggatc 

agagcatatt 

ttaaaccatc 



cccgtgatat 
gtatcgccgc 
gaggggatcg 
ccactaaaat 
acattttgaa 
cctgctcttt 
cataatttaa 
tagatctata 
aagtactgtg 
tgttccacat 
gactctagat 
aacttcaagc 
cctggtgtgt 
acttcccaca 
gggaggatag 
gatccgcctc 
actctcctgc 
taagcagaat 
ttctgcagcc 
atactggctt 
gtctgatgac 
tggggcctca 
ccttgatcct 
cctattctca 
taactttcca 
accattaatg 
gccaggctga 
attactgctt 
tgcaaagttc 
ctatttaagc 
tggtctaaat 
caaacacaag 
tgtgttgtgt 
agttcaagag 
gtggctcaca 
aagtgtattc 
gaatactgtc 
ggtagccaat 
atttgaagtg 
caatgcagac 
tctagtatcc 
aagctgaaag 
atttcaaaaa 
taatcccagc 
tacacatcaa 
gggctctgag 
ggctaaatcc 
ggttgggtct 
tgtttgctac 
gctgagattt 
gtcctttaat 
ggagtgaggt 
ttgtcttata 
tttagtatac 
tccaaattct 
tactgtgaat 
tgatctgcaa 
gttaaaatgt 
aagggagctt 
tcaccgcaac 
cctaggactt 
tcaaaaaaca 
tgcatataca 



5100 
5160 
5220 
5280 
5340 . 
5400 
5460 
.5520 
5580 
564 0 
5700 
5760 
5820. 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200' 
7260 
7320- 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
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cagggatatt 
catggatttc 
tgtgaaattg 
cgaggtctct 
atgtgctagg 
gatgtcatat 
taatgtgaaa 
agtcattttt 
tctctgacct 
gcatttcacc 
atgtttatcc 
agcactgtta 
aaataatgct 
ttttgttcaa 
catatttttt 
gccctggctg 
ctgcctctgc 
gatattttgt 
cagtagtctt 
ggctgttggt 
pagggataga 
atagaaatat 
ggcaggtcaa 
tttagcagcc 
cacatccacc 
tactcctccc 
acaaatggaa 
gaagcgggta 
tcagaggctg 
gggcgcccga 
cgcgctgttc 
tgccatcatg 
ttctcgcggc 
ggaagtccgc 
cgggatgggg 
cgtctacgta 
gaacatctac 
ggtggtaatg 
tctggctcct 
cctcatcttc 
tatgggcagc 
gcccggcacc 
caaacgccag 
ttacgggcta 
ctggggacag 
gggcccacga 
ggcccccaac 
cctccgttcc 
cgccctgctg 
accgacgata 
ggatcgatcc 
actaaaatgg 
attttgaatg 
tgctctttac 
taatttaaac 
gatctataga 
gtactgtggt 
ttccacatac 
ctctaggccg 
tttacaacgt 
tccccctttc 
gttgcgcagc 
tgtggtggtt 



aaaatattcc 
tccggtgaat 
ttttcaagtg 
tggtttatac 
cctaccttag 
gatttcatat 
gttaggtctt 
acatgtctta 
agtaacccta 
ttgaagttga 
agaagatatt 
gtgagcattg 
ggcctttttt 
agaaatactt 
ttcctttttt 
tcctggaact 
ctcctgagtg 
tatataacta 
aagtggtctt 
actagtggca 
aactagtcta 
ttcaggacat 
ttctaccggg 
ccgctgggca 
ggtaggcgcc 
ctagtcagga 
gtagcacgtc 
ggcctttggg 
ggaaggggtg 
aggt^cctccg 
tcctcttcct 
gcctcgtacc 
catagcaacc 
ccggagcaga 
aaaaccacca 
cccgagccga 
accacacaac 
acaagcgccc 
catatcgggg 
gaccgccatc 
atgacccccc 
aacatcgtgc 
cgccccggcg 
cttgccaata 
ctttcgggga 
ccccatatcg 
ggcgacctgt 
atgcacgtct 
caacttacct 
tgcgacctgg 
gtcctgtaag 
aagtttttcc 
gaaggattgg 
tgaaggctct 
aagcaaaacc 
tctctcgtgg 
ttccaaatgt 
acttcattct 
agctccaatt 
cgtgactggg 
gccagctggc 
ctgaatggcg 
acgcgcagcg 



aaataatatt 
aggcagagtt 
atagttaaag 
tcagaagcac 
gctgattgtt 
atcaaggcaa 
tgtgggtttg 
catttggtgg 
cctatagagc 
gacgttttgt 
caggactatt 
agtggccttt 
aaaaagccct 
gtttggatct 
tttttttttt 
cactttgtag 
ccgggattaa 
taaccaatac 
tattggccct 
cttaagcaac 
gcgttttgta 
ctagcacgtg 
taggggaggc 
cttggcgcta 
aaccggctcc 
agttcccccc 
tcactagtct 
gcagcggcca 
ggtccggggg 
gaggcccggc 
catctccggg 
ccggccatca 
gacgtacggc 
aaatgcccac 
ccacgcaact 
tgacttactg 
accgcctcga 
agataacaat 
gggaggctgg 
ccatcgccgc 
aggccgtgct 
ttggggccct 
agcggctgga 
cggtgcggta 
cggccgtgcc 
gggacacgtt 
ataacgtgtt 
ttatcctgga 
ccgggatggt 
cgcgcacgtt 
tctgcagaaa 
tgtcatactt 
agctacgggg 
ttactattgc 
aaattaaggg 
gatcattgtt 
gtcagtttqa 
cagtattgtt 
cgccctatag 
aaaaccctgg 
gtaatagcga 
aatgggacgc 
tgaccgctac 



16/39 

tcattcaagt 
ggaaactaaa 
cccatgagat 
ttctttgggt 
gttcaaataa 
aacatgttat 
atttttaatt 
aattgtataa 
actttgctgg 
tagtgtatac 
tttgactggg 
aggcttgaat 
tgttctttat 
ccttttgaca 
tggtttttcg 
accaggctgg 
aggcgtgcac 
taactccact 
tcattaaaat 
ttGctacgga 
tacctaccag 
ttaactcgag 
gcttttccca 
cacaagtggc 
gttctttggt 
cgccccgcag 
cgtgcagatg 
atagcagctt 
cgggctcagg 
attctgcacg 
cctttcgacc 
acacgcgtct 
gttgcgccct 
gctactgcgg 
gctggtggcc 
gcgggtgctg 
ccagggtgag 
gggcatgcct 
gagctcacat 
cctcctgtgc 
ggcgttcgtg 
.tccggaggac 
cctggctatg 
tctgcagtgc 
gccccagggt 
atttaccctg 
tgcctgggcc 
ttacgaccaa 
ccagacccac 
tgcccgggag 
ttgatgatct 
tgttaagaag 
gtgggggtgg 
tttatgataa 
ccagctcatt 
tttctcttga 
tagcctgaag 
ttgccaagtt 
tgagtcgtat 
cgttacccaa 
agaggcccgc 
gccctgtagc 
acttgccagc 



tttcccccat 
caaatgttgg 
acagaacaaa. 
ttccctgcac 
acttaagttt 
atatgttaaa 
ttcaaaacct 
ttgtggtttg 
gtcacaagtc 
tagtttatat 
ctaaggaatt 
tggagtcact 
caccctgttt 
acaatagcat 
agacagggtt 
cctcgaactc 
caccacgcct 
gggtggattt 
ctactgttca 
tatactagca 
ctttatacta 
ctgcaggatt 
aggcagtctg 
ctctggcctc 
ggccccttcg 
ctcgcgtcgt 
.gacagcaccg 
tgctccttcg 
ggcgggctca 
cttcaaaagc 
tgcagccaat 
gcgttcgacc 
cgccggcagc 
gtttatatag 
ctgggttcgc 
ggggcttccg 
atatcggccg 
tatgccgtga 
gccccgcccc 
tacccggccg 
gccctcatcc 
agacacatcg 
ctggctgcga 
ggcgggtcgt 
gccgagcccc 
tttcgggccc 
ttggacgtct 
tcgcccgccg 
gtcaccaccc 
atgggggagg 
attaaacaat 
ggtgagaaca 
ggtgggatta 
tgtttcatag 
cctcccactc 
ttcccacttt 
aacgagatca 
ctaattccat 
tacaattcac 
cttaatcgcc 
accgatcgcc 
ggcgcattaa 
gccctagcgc 



caaattggga 
ttttgtgatt 
gctgctattt 
tatcctgatc 
cctgtcaggt 
catttgtact 
gagctaaata 
caggcaagac 
taggagtcaa 
gttggaggac 
gattctgatt 
tgtatatctc 
tctacataat 
gttttcaagc 
tctctgtata 
agaaatccgc 
ggctaagttg 
ttaattcagt 
ctctaacaga 
gattaagggt 
ccttgttctg 
cgagggcccc 
gagcatgcgc 
gcacacattc 
cgccaccttc 
gcaggacgtg 
ctgagcaatg 
ctttctgggc 
ggggcggggc 
gcacgtctgc 
gcaccgtcct 
aggctgcgcg 
aagaagccac 
acggtcccca 
gcgacgatat 
agacaatcgc 
gggacgcggc 
ccgacgccgt 
cggccctcac 
cgcggtacct 
cgccgacctt 
accgcctggc 
ttcgccgcgt 
ggcgggagga 
agagcaacgc 
ccgagttgct 
tggccaaacg 
gctgccggga 
ccggctccat 
ctaactgagg 
aaagatgtcc 
gagtacctac 
gataaatgcc 
ttggatatca- 
atgatctata 
gtggttctaa 
gcagcctctg 
cagaagctga 
tggccgtcgt 
ttgcagcaca 
cttcccaaca 
gcgcggcggg 
ccgctccttt 
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8880 
8940 
9000' 
9060 
9120 
9180 
9240 • 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
"9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11'220 
11280 
11340 
11400 
11460 
11520 
11580 
1164 0 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
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cgctttcttc 
ggggctccct 
ttagggtgat 
gttggagtcc 
tatctcggtc 
aaatgagctg 
ttaggtggca 
cattcaaata 
aaaaggaaga 
ttttgccttc 
cagttgggtg 
agttttcgcc 
gcggtattat 
cagaatgact 
gtaagagaat 
ctgacaacga 
gtaactcgcc 
gacaccacga 
cttactctag 
' ccacttctgc 
gagcgtgggt 
gtagttatct 
gagataggtg 
ctttagattg 
gataatctca 
gtagaaaaga 
caaacaaaaa 
ctttttccga 
tagccgtagt 
ctaatcctgt 
tcaagacgat 
cagcccagct 
gaaagcgcca 
ggaacaggag 
gtcgggtttc 
agcctatgga 
tttgctcaca 
tttgagtgag 
gaggaagcgg 
taatgcagct 
aatgtgagtt 
atgttgtgtg 
tacgccaagc 
cgatggccat 



.ccttcctttc 
ttagggttcc 
ggttcacgta 
acgttcttta 
tattcttttg 
atttaacaaa 
cttttcgggg 
tgtatccgct 
gtatgagtat 
ctgtttttgc 
cacgagtggg 
ccgaagaacg 
cccgtattga 
tggttgagta 
tatgcagtgc 
tcggaggacc 
ttgatcgttg 
tgcctgtagc 
cttcccggca 
gctcggccct 
ctcgcggtat. 
acacgacggg 
cctcactgat 
atttaaaact 
tgaccaaaat 
tcaaaggatc 
aaccaccgct 
aggtaactgg 
taggccacca 
taccagtggc 
agtlfaccgga 
tggagcgaac 
cgcttcccga 
agcgcacgag 
gccacctctg 
aaaacgccag 
tgttctttcc 
ctgataccgc 
aagagcgccc 
ggcacgacag 
agctcactca 
gaattgtgag 
gcgcaattaa 
agagttacg 



tcgccacgtt 

gatttagtgc 

gtgggccatc 

atagtggact 

atttataagg 

aatttaacgc 

aaatgtgcgc 

catgagacaa 

tcaacatttc 

tcacccagaa 

ttacatcgaa 

ttttccaatg 

cgccgggcaa 

ctcaccagtc 

tgccataacc 

gaaggagcta 

ggaaccggag 

aatggcaaca 

acaattaata 

tccggqtggc 

cattgcagca 

gagtcaggca 

taagcattgg 

tcatttttaa 

cccttaacgt 

ttcttgagat 

accagcggtg 

cttcagcaga 

cttcaagaac 

tgctgccagt 

taaggcgcag 

gacctacacc 

agggagaaag 

ggagcttcca 

acttgagcgt 

caacgcggcc 

tgcgttatcc 

tcgccgcagc 

aatacgcaaa 

gtttcccgac 

ttaggcaccc 

cggataacaa 

ccctcactaa 



17/39 

cgccggcttt 
tttacggcac 
gccctgatag 
cttgttccaa 
gattttgccg 
gaattttaac 
ggaaccccta 
taaccctgat 
cgtgtcgccc 
acgctggtga 
ctggatctca 
atgagcactt 
gagcaactcg 
acagaaaagc 
atgagtgata 
accgcttttt 
ctgaatgaag 
acgttgcgca 
gactggatgg 
tggtttattg 
ctggggccag 
actatggatg 
taactgtcag 
tttaaaagga 
gagttttcgt 
cctttttttc 
gtttgtttgc 
gcgcagatac 
tctgtagcac 
ggcgataagt 
cggtcgggct 
gaactgagat 
gcggacaggt 
gggggaaacg 
cgatttttgt 
tttttacggt 
cctgattctg 
cgaa.cgaccg 
ccgcctctcc 
tggaaagcgg 
caggctttac 
tttcacacag 
agggaacaaa 
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ccccgtcaag 
ctcgacccca 
acggtttttc 
actggaacaa 
atttcggcct 
aaaatattaa 
tttgtttatt 
aaatgcttca. 
ttattccctt 
aagtaaaaga 
acagcggtaa 
ttaaagttct 
gtcgccgcat 
atcttacgga 
acactgcggc 
tgcacaacat 
ccataccaaa 
aactattaac 
ciggcggataa 
ctgataaatc 
atggtaagcc 
aacgaaatag 
accaagttta 
tctaggtgaa 
tccactgagc 
tgcgcgtaat 
cggatcaaga 
caaatactgt 
cgcctacata 
cgtgtcttac 
gaacgggggg 
acctacagcg 
atccggtaag 
cctggtatct 
gatgctcgtc 
tcctggcctt 
tggataaccg 
agcgcagcga 
ccgcgcgttg 
gcagtgagcg 
actttatgct 
gaaacagcta 
agctgtcgag 



ctctaaatcg 
aaaaacttga 
gccctttgac 
cactcaaccc 
attggttaaa 
cgcttacaat 
tttctaaata 
ataatattga 
ttttgcggca 
tgctgaagat 
gatccttgag 
gctatgtggc 
acactattct 
tggcatgaca 
caacttactt 

gggggatcat 

cgacgagcgt 
tggcgaacta 
agttgcagga 
tggagccggt 
ctcccgtatc 
acagatcgct 
ctcatatata 
gatccttttt 
gtcagacccc. 
ctgctgcttg 
gctaccaact 
ccttctagtg 
cctcgctctg 
cgggttggac 
ttcgtgcaca 
tgagctatga 
cggcagggtc 
ttatagtcct 
aggggggcgg 
ttgctggcct 
tattaccgcc 
gtcagtgagc 
gccgattcat 
caacgcaatt 
tccggctcgt 
tgaccatgat 
atctagatat 



1266a 

12720" 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 • 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15199 



<210> 7 
<211> 4640 
<212> DNA- ■ 

<213>- Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Rluc-U6-shRNA neo insert 



Sequence of 



'<400> 7 
tctaggtaac 
tttccttgat 
caaactcttc 
tctaagtaag 
gaccccgagc 
atgaacgtgc 
gtgatttttc 



cgatatccct 
gatgtcatac 
gcggtctttc 
cttggcattc 
aacgcaaacg 
tggactcctt 
tgcatggtaa 



gcaggggtga 
ttatcctgt'c 
cagtactcct 
cggtactgtt 
catgatcact 
catcaactac 
cgctgcctcc . 



cctgcacgtc 
cctttttttt 
gcaggtgact 
ggtaaagcca 
gggcctcagt 
tatgattccg 
agctacctgt 



tagggcgcag 
ccacagctcg 
gactgagtcg 
ccatggcttc 
ggtgggctcg 
agaagcacgc 
ggaggcacgt 



tagtccaggg. 60 
cggttgagga 120 
agatctgcga 180 
caaggtgtac 240 
ctgcaagcaa 300 
cgagaacgce 360 
cgtgcctcac 420 
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aitcgagcccg 
agcgggaatg 
ctgctgaacc 
tttcactact 
gtggacgtga 
aagagcgaag 
ccaagcaaga 
aaggagaagg 
aagggaggca 
agcgacgatc 
gtcgagggag 
agccaggagg 
ctgaagaacg 
cattattttc 
gaatgaggcg 
agaatgacct 
gccagggatc 
actactcaag 
attcctgcgc 
gtggcatccc 
gcccaccagc 
ctataatatt 
.ctgtagggcc 
cactgcaatc 
gggattccag 
tttcaccata 
gcctcccaaa 
tttaaaataa 
ccaaccggtg 
tcagtagatg 
tatgctatac 
ggagcgtgac 
ccttcatatt 
gtaaacacaa 
tttgcagttt 
tatttcgatt 
cagcgggagc 
tttctagact 
accgaagttc 
ttctaccggg 
ccgctgggca 
ggtaggcgcc 
ctagtca'gga 
gtagcacgtc 
ggcctttggg 
ggaaggggtg 
aggtcctccg 
tcctcttcct 
agatggattg 
ggcacaacag 
cccggttctt 
agcgcggcta 
cactgaagcg 
atctcacctt 
-tacgcttgat 
acgtactcgg 
gctcgcgcca 
cgtcgtgacc 
tggattcatc 
tacccgtgat 
cggtatcgcc 
ctgaggggat 
gtccactaaa 



tggctagatg 
gctcatatcg 
ttccaaagaa 
cctacgagca 
tcgagtcctg 
agggcgagaa 
tcatgcggaa 
gcgaggttag 
agcccgacgt 
tgcctaagat 
ctaagaagtt 
acgctccaga 
agcagtaatt 
aatagatctg 
cggccaaggg 
tgggggaggg 
ttcaagcaga 
aactacgggc 
atcgtgcagt 
tgtgacccct 
cttgtcctaa 
^tggggtgga 
tgcggggtct 
tccgcctcct 
gcatgcatga 
ttggccaggc 
ttgctgggat 
ctataccagc 
ggacatttga 
cctgttgaat 
gaagttatgg 
cgcgcgccga 
tgcatatacg 
agatattagt 
taaaattatg 
tcttggcttt 
cacctgatga 
cgagataact 
ctatactttc 
taggggaggc 
cttggcgcta 
aaccggctcc 
agttcccccc 
tcactagtct 
gcagcggcca 
ggtccggggg 
gaggcccggc 
catctccggg 
cacgcaggtt 
acaatcggct 
tttgtcaaga 
tcgtggctgg 
ggaagggact 
gctcctgccg 
ccggctacct 
atggaagccg 
gccgaactgt 
catggcgatg 
gactgtggcc 
attgctgaag 
gctcccgatt 
cgatccgctg 
atggaagttt 



catcatccct 

cctcctggat 

aatcatcttt 

ccaagacaag 

ggacgagtgg 

aatggtgctt 

actggagcct 

acggcctacc 

cgtccagatt 

gttcatcgag 

ccctaacacc 

tgaaatgggt 

ctagaccggt 

tgtgttggtt 

ggagggggag 

ggaggccaga 

cctacagcaa 

tgctctactg 

gccgctctgt 

ccccagtgcc 

taaaattaag 

ggggggtggt 

attgggaacc 

gggttcaagc 

ccaggctcag 

tggtctccaa 

tacaggcgtg 

aggaggacgt 

gttgcttgct 

taagcttatt 

atccagtgga 

gcccaaggtc 

atacaaggct 

acaaaatacg 

ttttaaaatg 

atatatcttg 

agcttgatcg 

tcgtataatg 

tagagaatag 

gcttttccca 

cacaagtggc 

gttctttggt 

cgccccgcag 

cgtgcagatg 

atagcagctt 

cgggctcagg 

attctgcacg 

cctttcgacc 

ctccggccgc' 

gctctgatgc 

ccgacctgtc 

ccacgacggg 

ggctgctatt 

agaaagtatc 

gcccattcga 

gtcttgtcga 

tcgccaggct 

cctgcttgcc 

ggctgggtgt 

agcttggcgg 

cgcagcgcat 

taagtctgca 

ttcctgtcat 



18/39 

gatctgatcg 
cactacaagt 
gtgggccacg 
atcaaggcca 
cctgacatcg 
gagaataact 
gaggagttcg 
ctctcctggc 
gtccgcaact 
tccgaccctg 
gagttcgtga 
aagtacatca 
tcgagatcca 
ttttgtgtgc 
gccagaatga 
atgaggcgcg 
gttcgacaca 
cttcaggaag 
ggagggcagc 
tctcctggcc 
ttgcatcatt 
atggagcaag 
aagctggagt 
gattctcctg 
ctaatttttg 
ctcctaatct 
aaccactgct 
ccagacacag 
tggcactgtc 
taaataggcc 
aagacgcgca 
gggcaggaag 
gttagagaga 
tgacgtagaa 
gactatcata 
tggaaaggac 
ggtggctctc 
tatgctatac 
gaacttcgga 
aggcagtctg 
ctctggcctc 
ggccccttcg 
ctcgcgtcgt 
gacagcaccg 
tgctccttcg 
ggcgggctca 
cttcaaaagc 
tgcagccaat 
ttgggtggag 
cgccgtgttc 
cggtgccctg 
cgttccttgc 
gggcgaagtg 
catcatggct 
ccaccaagcg 
tcaggatgat 
caaggcgcgc 
gaatatcatg 
ggcggaccgc 
cgaatgggct 
cgccttctat 
gaaattgatg 
actttgttaa 



0' 
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gaatgggtaa 
acctcaccgc 
actggggggc 
tcgtccatgc 
aggaggatat 
tcttcgtcga 
ctgcctacct 
ctcgcgagat 
acaacgccta 
ggttcttttc 
aggtgaaggg 
agagcttcgt 
ggcgcggatc 
cttgggggag 
ccttggggga 
gatccgtcga 
aactcacaca 
gacatggaca 
tgtggcttct 
ctggaagttg 
ttgtctgact 
gggcaagttg 
gcagtggcac 
cctcagcctc 
tttttttggt 
caggtgatct 
cccttccctg 
cataggctac 
ctctcatgcg 
ggccataact 
ggcaaaacgc 

c^gggcctatt 

taattagaat 
agtaataatt 
tgcttaccgt 
gaaacaccgg 
gctgagttgg 
gaagttatgg 
ataggaactt 
gagcatgcgc 
gcacacattc 
cgccaccttc 
gcaggacgtg 
ctgagcaatg 
ctttctgggc 
ggggcggggc 
gcacgtctgc 
atgggatcgg 
aggctattcg 
cggctgtcag 
aatgaactgc 
gcagctgtgc 
ccggggcagg 
gatgcaatgc 
aaacatcgca 
ctggacgaag 
atgcccgacg 
gtggaaaatg 
tatcaggaca 
gaccgcttcc 
cgccttcttg 
atctattaaa 
gaagggtgag 



gtccggcaag 

ttggttcgag 

ttgtctggcc 

tgagagtgtc 

cgccctgatc 

gaccatgctc 

ggagccattc 

ccctctcgtt 

ccttcgggcc 

caacgctatt 

cctccacttc 

ggagcgcgtg 

aataaaagat 

ggggaggcca 

gggggaggcc 

cttaattaag 

acgatgacgc 

aggtcgagac 

agctgcccgg 

ccactccagt 

aggtgtcctt 

ggaagacaac 

aatcttggct 

ccgagttgtt 

agagacgggg 

acccaccttg 

tccttctgat 

ctggccatgc 

ttgggtccac 

tcgtataatg 

accacgtgac 

tcccatgatt 

taatttgact 

tcttgggtag 

aacttgaaag 

gattccaatt 

aatccatttt 

cgcgccggta 

cttaggtcaa 

tttagcagcc 

cacatccacc 

tactcctccc 

acaaatggaa 

gaagcgggta 

tcagaggctg 

gcrgcgcccga 

cgcgctgttc 

ccattgaaca 

gctatgactg 

cgcaggggcg 

aggacgaggc 

tcgacgttgt 

atctcctgtc 

ggcggctgca 

tcgagcgagc 

agcatcaggg- 

gcgaggatct 

gccgcttttc 

tagcgttggc 

tcgtgcttta 

acgagttctt 

caataaagat 

aacagagtac 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080" 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 
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ctacattttg 
tgcctgctct 
atcataattt 
tatagatcta 
ctaagtactg 
tctgttccac 
ctgactctag 
ggaacttcaa 



aatggaagga 
ttactgaagg 
aaacaagcaa 
tagatctctc 
tggtttccaa 
atacacttca 
atcccgcgcc 
gcttaagcgc 



ttggagctac 
ctctttacta 
aaccaaatta 
gtgggatcat 
atgtgtcagt 
ttctcagtat 
gaagttccta 
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gggggtgggg 
ttgctttatg 
^gggccagct 
tgtttttctc 
ttcatagcct 
tgttttgcca 
tactttctag 



gtggggtggg 
ataatgtttc 
cattcctccc 
ttgattccca 
gaagaacgag 
agttctaatt 
agaataggaa 
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attagataaa 
atagttggat 
actcatgatc 
ctttgtggtt 
atcagcagcc 
ccatcagaag 
cttcggaata 



4260 
4320 
4380 
4440 
4500 
4560 
4620 
4640 



<210> 8 
<211> 15174 
•<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Targeting 

vector for Rosa26 locus with a Rluc-U6-shRNA neo 
insert 



<400> 8 
ctagggataa 
tgagacagcc 
cggtccgccc 
^gcggccatg 
gctgattggc 
cgtaaggcgc 
cccactgggt 
gcctctggcg 
ccgcccaccc 
gtctgattgg 
gttgagtcca 
ccctcggccc 
gcgctggggg 
tccgacttga 
gagtggaggg 
tcccaaagtc 
ggccgcaccc 
' gctgcctcct 
tcgtgatctg 
gtctagggcg 
tttccacagc 
actgactgag 
ccaccatggc* 
agtggtgggc 
ccgagaagca 
tgtggaggca 
tcggaatggg 
agtacctcac 
acgactgggg 
ccatcgtcca 
tcgaggagga 
acttcttcgt 
tcgctgccta 
ggcctcgcga 
actacaacgc 
ctgggttctt. 
tgaaggtgaa 
tcaagagctt 
ccaggcgcgg 
tgccttgggg 
tgaccttggg 
gcggatccgt 



cagggtaata 
gggtacgagt 
tgcagcaacc 
gctcgggggg 
ttcttttcct 
ctgtcagtta 
ggggcgggag 
gggcggggga 
tccccttcct 
ctctcggggc 
tccgccggcc 
cgcgccgcag 
cggggacggg 
gttgcctcaa 
aaggagcgag 
gctctgagtt 
ttctccggag 
ggcttctgag 
caactccagt 
cagtagtcca 
tcgcggttga 
tcgagatctg 
ttccaaggtg 
tcgctgcaag 
cgccgagaac 
cgtcgtgcct 
taagtccggc 
cgcttggttc 
ggcttgtctg 
tgctgagagt 
tatcgccctg 
cgagaccatg 
cctggagcca 
gatccctctc 
ctaccttcgg 
ttccaacgct 
gggcctccac 
cgtggagcgc 
atcaataaaa 
gagggggagg 
ggagggggag 
cgacttaatt 



tagccgcggc 

cgtgacgctg 

ggagggggag 

gggggggcag 

cccgccgtgt 

acggcagccg 

gtaggtgggg 

ggggagggag 

ctgggggagt 

ccagaaaact 

agcgggggcg 

agtctggccg 

cagtagggct 

gaggggcgtg 

ggctcagttg 

gttatcagta 

gggggagggg 

gaccgccctg 

ctttctaggt 

gggtttcctt 

ggacaaactc 

cgatctaagt 

tacgaccccg 

caaatgaacg 

gccgtgattt 

cacatcgagc 

aagagcggga 

■gagctgctga 

gcctttcact 

gtcgtggacg 

atcaagagcg 

ctcccaagca 

ttcaaggaga 

gttaagggag 

gccagcgacg 

attgtcgagg 

ttcagccagg 

gtgctgaaga 

gatcattatt 

ccagaatgag 

gccagaatga- 

aaggccaggg 



aggccctccg 
gaaggggcaa 
ggagaaggga 
cggaggagcg 
gtgaaaacac 
gagtgcgcag 
tgaggcgagc 
ggtcagcgaa 
cgttttaccc 
ggcccttgcc 
gcgaggaggc 
cgcgcccctg 
gagcggctgc 
ctgagccaga 
ggctgttttg 
agggagctgc 
agtgttgcaa 
ggcctgggag 
aaccgatatc 
gatgatgtca 
ttcgcggtct 
aagcttggca 
agcaacgcaa 
tgctggactc 
ttctgcatgg 
ccgtggctag 
atggctcata 
accttccaaa 
actcctacga 
tgatcgagtc 
aagagggcga 
agatcatgcg 
agggcgaggt 
gcaagcccga 
atctgcctaa 
gagctaagaa 
aggacgctcc 
acgagcagta 
ttcaatagat 
gcgcggccaa 
ccttggggga 
atcttcaagc 



agcgtggtgg 
gcgggtggtg 
gcggaaaagt 
cttccggccg 
aaatggcgtg 
ccgccggcag 
tggacgtgcg 
agtagctcgc 
gccgccggcc 
attggctcgt 
gctcccaggt 
cgcaacgtgg 
ggggcgggtg 
cctccatcgc 
gaggcaggaa 
agtggagtag 
tacctttctg 
aatcccttcc 
cctgcagggg 
tacttatcct 
ttccagtact 
ttccggtact 
acgcatgatc 
cttcatcaac 
taacgctgcc 
atgcatcatc 
tcgcctcctg 
gaaaatcatc 
gcaccaagac 
ctgggacgag 
gaaaatggtg 
gaaactggag 
tagacggcct 
cgtcgtccag 
gatgttcatc 
gttccctaac 
agatgaaatg 
attctagacc 
ctgtgtgttg 
gggggagggg 
gggggaggcc 
agacctacag 



agccgttctg 
ggcaggaatg 
ctccaccgga 
acgtctcgtc 
ttttggttgg 
cctcgctctg 
ggcgcggtcg 
gcgcgagcgg 
gggcctcgtc 
gttcgtgcaa 
tccggccctc 
caggaagcgc 
caagcacgtt 
gcactccggg 
gcacttgctc 
gcggggagaa 
ggagttctct 
ccctcttccc 
tgacctgcac 
gtcccttttt 
cctgcaggtg 
gttggtaaag 
actgggcctc 
tactatgatt 
tccagctacc 
cctgatctga 
gatcactaca 
tttgtgggcc 
aagatcaagg 
tggcctgaca 
cttgagaata 
cctgaggagt 
accctctcct 
attgtccgca 
gagtccgacc 
accgagttcg 
ggtaagtaca 
ggttcgagat 
gttttttgtg 
gaggccagaa 
agaatgaggc 
caagttcgac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 
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acaaactcac* 

aaggacatgg 

agctgtggct 

gccctggaag 

attttgtctg 

aaggggcaag 

agtgcagtgg 

ctgcctcagc 

ttgttttttt 

tctcaggtga 

gctcccttcc 

cagcataggc 

gtcctctcat 

gccggccata 

gcaggcaaaa 

aagagggcct 

agataattag 

gaaagtaata 

atatgcttac 

gacgaaacac 

ctcgctgagt 

tacgaagtta 

ggaataggaa 

ctggagcatg 

ctcgcacaca- 

tcgcgccacc 

cgtgcaggac 

ccgctgagca 

tcgctttctg 

tcaggggcgg 

agcgcacgtc 

aatatgggat 

gagaggctat 

ttccggctgt 

ctgaatgaac 

tgcgcagctg 

gtgccggggc 

gctgatgcaa 

gcgaaacatc 

gatctggacg 

cgcatgcccg 

atggtggaaa 

cgctatcagg 

gctgaccgct 

tatcgccttc 

atgatctatt 

taagaagggt 

ggggtggggt 

atgataatgt 

gctcattcct 

ctcttgattc 

cctgaagaac 

ccaagttcta 

tagagaatag 

tcttctgggc 

acaggtgtaa 

tttaataggg 

aaaactacag 

aaagacatgc 

ttacagtgtc 

acttcatatc 

cattttagcc 

cattttccct 



acaacgatga 

acaaggtcga 

tctagctgcc 

ttgccactcc 

actaggtgtc 

ttgggaagac 

cacaatcttg 

ctcccgagtt 

ggtagagacg 

tctacccacc 

ctgtccttct 

tacctggcca 

gcgttgggtc 

acttcgtata 

cgcaccacgt 

atttcccatg 

aattaatttg 

atttcttggg 

cgtaacttga- 

cgggattcca 

tggaatccat 

tggcgcgccg 

cttcttaggt . 

cgctttagca 

ttccacatcc 

ttctaetcct 

gtgacaaatg 

atggaagcgg 

ggctcagagg 

ggcgggcgcc 

tgccfgcgctg 

cggccattga 

tcggctatga 

cagcgcaggg 

tgcaggacga 

tgctcgacgt 

aggatctcct. 

tgcggcggct 

gcatcgagcg 

aagagcatca 

acggcgagga 

atggccgctt 

acatagcgtt 

tcctcgtgct 

ttgacgagtt 

aaacaataaa 

gagaacagag 

gggattagat 

ttcatagttg 

cccactcatg 

ccactttgtg 

gagatcagca 

attccatcag 

gaacttcgga 

aggcttaaag 

aattggaggg 

gcaaataagg 

gttattattg 

tcacccgagt 

gcgagttaga 

catttctccc 

ccattttcat 

ttcctgatct 



cgcactactc 

gacattcctg 

cgggtggcat 

agtgcccacc 

cttctataat 

aacctgtagg 

gctcactgca 

gttgggattc' 

gggtttcacc 

ttggcctccc 

■gattttaaaa 

tgcccaaccg 

cactcagtag 

atgtatgcta 

gacggagcgt 

attccttcat 

actgtaaaca 

tagtttgcag 

aagtatttcg 

attcagcggg 

ttttttctag 

gtaaccgaag 

caattctacc 

gccccgctgg 

accggtaggc 

cccctagtca 

gaagtagcac 

gtaggccttt 

ctgggaaggg 

cgaaggtcct 

ttctcctctt 

acaagatgga 

ctgggcacaa 

gcgcccggtt 

ggcagcgcgg 

tgtcactgaa 

gtcatctcac 

gcatacgctt 

agcacgtact 

ggggctcgcg 

tctcgtcgtg 

ttctggattc 

ggctacccgt 

ttacggtatc 

cttctgaggg 

gatgtccact 

tacctacatt 

aaatgcctgc 

gatatcataa 

atctatagat 

gttctaagta 

gcctctgttc 

aagctgactc 

ataggaactt 

gctaacctgg 

acaagacttc 

aaaatgggag 

cttgtgatcc 

tttatactct 

ctatgtaagc 

gctccttctg 

ttattatact 

tagaagtctg 
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aagaactacg 
cgcatcgtgc 
ccctgtgacc 
agccttgtcc 
attatggggt 
gcctgcgggg 
atctccgcct" 
caggcatgca 
atattggcca 
aaattgctgg 
taactatacc 
gtgggacatt 
atgcctgttg 
tacgaagtta 
gaccgcgcgc 
atttgcatat 
caaagatatt 
ttttaaaatt 
atttcttggc 
agccacctga 
actcgagata 
ttcctatact 
gggtagggga 
gcacttggcg 
gccaaccggc 
ggaagttccc 
gtctcactag 
ggggcagcgg 
gtgggtccgg 
ccggaggccc 
cctcatctcc 
ttgcacgcag 
cagacaatcg 
ctttttgtca 
ctatcgtggc 
gcgggaaggg 
cttgctcctg 
gatccggcta 
cggatggaag 
ccagccgaac 
acccatggcg 
atcgactgtg 
gatattgctg 
gccgctcccg 
gatcgatccg 
aaaatggaag 
ttgaatggaa 
tctttactga 
tttaaacaag 
ctatagatct 
ctgtggtttc 
cacatacact 
tagatcccgc 
caagcttaag 
tgtgtgggcg 
ccacagattt 
gataggtagt 
gcctcggagt 
cctgcttgag 
agaattttaa' 
cagccttatc 
ggcttatcca 
atgactcatg 



/EP2003/011233 



ggctgctcta 
agtgccgctc 
cctccccagt 
taataaaatt 
ggaggggggt 
tctattggga 
cctgggttca 
tgaccaggct 
ggctggtctc 
gattacaggc 
agcaggagga 
tgagttgctt 
aattaagctt 
tggatccagt 
cgagcccaag 
acgatacaag 
agtacaaaat 
atgttttaaa 
tttatatatc 
tgaagcttga 
acttcgtata 
ttctagagaa 
ggcgcttttc 
ctacacaagt 
tccgttcttt 
ccccgccccg 
tctcgtgcag 
ccaatagcag 
gggcgggctc 
ggcattctgc 
gggcctttcg 
gttctccggc 
gctgctctga 
agaccgacct 
tggccacgac 
actggctgct 
ccgagaaagt 
cctgcccatt 
ccggtcttgt 
tgttcgccag 
atgcctgctt 
gccggctggg 
aagagcttgg 
attcgcagcg 
ctgtaagtct 
tttttcctgt 
ggattggagc 
aggctcttta 
caaaaccaaa 
ctcgtgggat 
caaatgtgtc 
tcattctcag 
gccgaagttc 
cgctagaaga 
ttgtcctgca 
tcggttttgt 
catctggggt 
attttccatc 
atccttacta 
tcatttttaa 
aaaaggtatt 
acccctagac 
aaaccagaca 



ctgcttcagg 

tgtggagggc 

gcctctcctg 

aagttgcatc 

ggtatggagc 

accaagctgg 

agcgattctc 

cagctaattt 

caactcctaa 

gtgaaccact ■ 

cgtccagaca 

gcttggcact 

atttaaatag 

ggaaagacgc 

gtcgggcagg 

gctgttagag 

acgtgacgta 

atggactatc 

ttgtggaaag 

tcgggtggct 

atgtatgcta 

taggaacttc 

ccaaggcagt 

ggcctctggc 

ggtggcccct 

cagctcgcgt 

atggacagca 

ctttgctcct 

aggggcgggc 

acgcttcaaa 

acctgcagcc 

cgcttgggtg 

tgccgccgtg 

gtccggtgcc 

gggcgttcct 

attgggcgaa 

atccatcatg 

cgaccaccaa 

cgatcaggat 

gctcaaggcg 

gccgaatatc 

tgtggcggac 

<^9gcgaatgg 

catcgccttc 

gcagaaattg 

catactttgt 

tacgggggtg 

ctattgcttt 

ttaagggcca 

cattgttttt 

agtttcatag 

tattgttttg 

ctatactttc 

tgggcgggag 

9gggaattga 

cgggaagttt 

tttatgcagc 

gaggtagatt 

cagtatgaaa 

agagcccagt 

ttagaacact 

agagcattgg 

gattagttac 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 ' 

3300 

3360' 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380- 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4 8 60' 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 
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atacaccaca aatcgaggct gtagctgggg 
tagtacactt tttgttgatc ctttgccttg 
ccgtcagtgg tgttccacat ttgggcctat 
tgtattgaga" atccaaccta aagcttaact 
tctccattta taaactgagc tattaaccat 
tattacctga tgtatcttac atattgccag 
ttcattattg agccacatgg tattgattac 
ctgtaaaagg tggttccttt tggaatgcaa 
aaggtcttgt gagcttgtat tttttctatt 
ctcatggcat tctacacgtt attgctggtc 
cctataattt tgcttgactt gtagccaaac 
ttgtgaagct tggtttttag gttgttgtgt 
tccaggggct ggagaggtgg ctcggagttc 
gagttcaatt cccagcaacc acatggtggc 
•ctcttctggt gtgtctgaag accacaagtg 
ttcttctttt tttttttttt aaagagaata 
aaatactttg tgtttgttcc aatatggtag 
aaatgttacc aaggaactaa tttttatttg 
tgcagacttg tgggatacag aagaccaatg 
gcaataaaga aataaaaatt gaacttctag 
ttaacttttg tgcttcatct atacaaagct 
tgaaagcaag taatgataat tttggatttc 
gtggtggtgc ttgcctttat gcctttaatc 
ctctgagttt gagcccagcc tggtctacac 
cacacagaaa ccctgttggg gaggggggct 
ttccctaatg agccactatg gatgtggcta 
tattattttt tctgtctctg ctgttggttg 
gcctccttcc tgccatgtgg tctcttgttt 
ggctttttgc cttctaaggg cagctgctga 
.gggaaatctt tcaa^acacta aaattgtcct 
aataaacctc ataaaatagt tatgaggagt 
cctataaaag agctattaag gctttttgtc 
tctttagaac caagggtctt agagttttag 
attttctagt ttcaaatcca gagaatccaa 
ttctgacttt taatgttaat ttgcttactg 
cagggtctca ctatgtatct ctgcctgatc 
ctgcttttgt ctcctgaata ctaaggttaa 
cagattcttt tataggggac acactaaggg 
caagtgatga aaacttgaat tattatcacc 
cctgttagag catgcttaag ggatccctag 
ggcaggctcc tggtgagagc atatttcaaa 
taaggttacc tgtctttaaa ccatctgcat 
atatttcatt caagttttcc cccatcaaat 
gagttggaaa ctaaacaaat gttggttttg 
taaagcccat gagatacaga acaaagctgc 
agcacttctt tgggtttccc tgcactatcc 
ttgttgttca aataaactta agtttcctgt 
ggcaaaacat gttatatatg ttaaacattt 
gtttgatttt taattttcaa aacctgagct 
ggtggaattg tataattgtg gtttgcaggc 
agagcacttt gctgggtcac aagtctagga 
tttgttagtg tatactagtt tatatgttgg 
ctatttttga ctgggctaag gaattgattc 
cctttaggct tgaattggag tcacttgtat 
gcccttgttc tttatcaccc tgttttctac 
gatctccttt tgacaacaat agcatgtttt 
ttttttggtt tttcgagaca gggtttctqt 
tgtagaccag gctggcctcg aactcagaaa 
attaaaggcg tgcaccacca cgcctggcta 
aatactaact ccactgggtg gatttttaat 
gcccttcatt aaaatctact gttcactcta 
gcaacttcct acggatatac tagcagatta 
ttgtatacct accagcttta tactaccttg 
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cctcaacact gcagttcttt tataactcct 6360 
atccttaatt ttcagtgtct atcacctctc 6420 
tctcagtcca gggagtttta caacaataga 64 80 
ttccactccc atgaatgcct ctctcctttt 6540 
taatggttcc aggtggatgt ctcctcccca 6600 
gctgatattt taagacatta aaaggtatat 6660 
tgcttactaa aattttgtca ttgtacacat 6720 
agttcaggtg. tttgttgtct ttcctgacct 6780" 
taagcagtgc tttctcttgg actggcttga 6840' 
taaatgtgat tttgccaagc ttcttcagga 6900 
acaagtaaaa tgattaagca acaaatgtat 6960 
tgtgtgtgct tgtgctctat aataatacta 7020 ' 
aagagcacag actgctcttc cagaagtcct 7080 
tcacaaccat ctgtaatggg atctgatgcc 7140 
tattcacatt aaataaataa atcctccttc 7200 
ctgtctccag tagaatttac tgaagtaatg 7260 - 
ccaataatca aattactctt taagcactgg 7320 
aagtgtaact gtggacagag gagccataac 7380 
cagactttaa .tgtcttttct cttacactaa 7440 
tatcctattt gtttaaactg ctagctttac 7500 
gaaagctaag tctgcagcca ttactaaaca 7560 
aaaaatgtag ggccagagtt tagccagcca 7620 
ccagcactct ggaggcagag acaggcagat 7 680 
atcaagttct atctaggata gccaggaata 77 40 
ctgagatttc ataaaattat aattgaagca 7 800 
aatccgtcta cctttctgat gagatttggg 7860 
ggtcttttga cactgtgggc tttctttaaa 7 920 
gctactaact tcccatggct taaatggcat 7980 
gatttgcagc ctgatttcca gggtggggtt 8040 
ttaatttttt ttttaaaaaa tgggttatat 8100 
gaggtggact aatattaaat gagtccctcc 8160 
ttatacttaa cttttttttt aaatgtggta 8 220 
tatacagaaa ctgttgcatc gcttaatcag 8280 
attcttcaca gccaaagtca aattaagaat 8340 
tgaatataaa aatgatagct tttcctgagg 8400 
tgcaacaaga tatgtagact aaagttctgc 84 60 
aatgtagtaa tacttttgga acttgcaggt 8520 
agcttgggtg atagttggta aaatgtgttt 8580 
gcaacctact ttttaaaaaa aaaagccagg 8640 
gacttgctga gcacacaaga gtagttactt 8700 
aaacaaggca gacaaccaag aaactacagt 87 60 
atacacaggg atattaaaat attccaaata 8820 
tgggacatgg atttctccgg tgaataggca 8 880 
tgatttgtga aattgttttc aagtgatagt 8940 
tatttcgagg tctcttggtt tatactcaga 9000 
tgatcatgtg ctaggcctac cttaggctga 9060 
caggtgatgt catatgattt catatatcaa 9120 
gtacttaatg tgaaagttag gtctttgtgg -9180 
aaataagtca tttttacatg tcttacattt 9240 ' 
aagactctct gacctagtaa ccctacctat 9300 
gtcaagcatt tcaccttgaa gttgagacgt 9360 
aggacatgtt tatccagaag atattcagga 9420 
tgattagcac tgttagtgag cattgagtgg 9480 
atctcaaata atgctggcct tttttaaaaa 9540 
ataatttttg ttcaaagaaa tacttgtttg 9600 
caagccatat tttttttcct tttttttttt 9660 
gtatagccct ggctgtcctg gaactcactt 9720 
tccgcctgcc tctgcctcct gagtgccggg 9780 
agttggatat tttgttatat aactataacc 98 40 
tcagtcagta gtcttaagtg gtctttattg 9900 
acagaggctg ttggtactag tggcacttaa 9960 
agggtcaggg ataga^acta gtctagcgtt 10020 
ttctgataga aatatttcag gacatctagc 10080 
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acgt.gttaac tcgagctgca ggattcgagg gccccggcag gtcaattcta ccgggtaggg 10140 • 
gaggcgcttt . tcccaaggca gtctggagca tgcgctttag cagccccgct gggcacttgg 10200 
cgctacacaa gtggcctctg gcctcgcaca cattccacat ccaccggtag gcgccaaccg 10260 
gctccgttct ttggtggccc cttcgcgcca ccttctactc ctcccctagt caggaagttc 10320 
ccccccgccc cgcagctcgc gtcgtgcagg acgtgacaaa tggaagtagc acgtctcact 10380 
agtctcgtgc agatggacag caccgctgag caatggaagc gggtaggcct ttggggcagc 10440 
ggccaatagc agctttgctc cttcgctttc tgggctcaga ggctgggaag gggtgggtcc 10500 
gggggcgggc tcaggggcgg gctcaggggc ggggcgggcg cccgaaggtc ctccggaggc 10560 
ccggcattct gcacgcttca aaagcgcacg tctgccgcgc tgttctcctc ttcctcatct 10 620 
ccgggccttt cgacctgcag ccaatgcacc gtccttgcca tcatggcctc gtaccccggc 10 680. 
catcaacacg cgtctgcgtt cgaccaggct gcgcgttctc gcggccatag caaccgacgt 10740 
acggcgttgc gccctcgccg gcagcaagaa gccacggaag tccgcccgga gcagaaaatg 10800 
cccacgctac tgcgggttta tatagacggt ccccacggga tggggaaaac caccaccacg 10 860 
caactgctgg tggccctggg ttcgcgcgac gatatcgtct acgtacccga gccgatgact 10920 
tactggcggg tgctgggggc ttccgagaca atcgcgaaca tctacaccac acaacaccgc 10980 
ctcgaccagg gtgagatatc ggccggggac gcggcggtgg taatgacaag cgcccagata 11040 • 
acaatgggca tgccttatgc cgtgaccgac gccgttctgg ctcctcatat cgggggggag 11100 
gctgggagct cacatgcccc gcccccggcc ctcaccctca tcttcgaccg ccatcccatc 11160 
gccgccctcc tgtgctaccc ggccgcgcgg taccttatgg gcagcatgac cccccaggcc 11220 
gtgctggcgt tcgtggccct catcccgccg accttgcccg gcaccaacat cgtgcttggg 11280 
.gcccttccgg aggacagaca catcgaccgc ctggccaaac gccagcgccc cggcgagcgg 11340 
ctggacctgg ctatgctggc tgcgattcgc cgcgtttacg ggctacttgc caatacggtg 11400 
cggtatctgc agtgcggcgg gtcgtggcgg gaggactggg gacagctttc ggggacggcc 11460 
gtgccgcccc agggtgccga gccccagagc aacgcgggcc cacgacccca tatcggggac 11520 
acgttattta ccctgtttcg ggcccccgag ttgctggccc ccaacggcga cctgtataac 11580 
gtgtttgcct gggccttgga cgtcttggcc aaacgcctcc gttccatgca cgtctttatc 11640 
ctggattacg accaatcgcc cgccggctgc cgggacgccc tgctgcaact tacctccggg 11700 
atggtccaga cccacgtcac cacccccggc tccataccga cgatatgcga cctggcgcgc 11760 
acgtttgccc gggagatggg ggaggctaac tgaggggatc gatccgtcct gtaagtctgc 11820* 
agaaattgat gatctattaa acaataaaga tgtccactaa aatggaagtt tttcctgtca 11880 
tactttgtta agaagggtga gaacagagta cctacatttt gaatggaagg attggagcta 1194 0 
cgggggtggg ggtggggtgg gattagataa atgcctgctc tttactgaag gctctttact 12 000 
attgctttat gataatgttt catagttgga tatcataatt taaacaagca aaaccaaatt 12060 
aagggccagc tcattcctcc cactcatgat ctatagatct atagatctct cgtgggatca 12120 
ttgtttttct cttgattccc actttgtggt tctaagtact gtggtttcca aatgtgtcag 12180 
tttcatagcc tgaagaacga gatcagcagc ctctgttcca catacacttc attctcagta 12240 
ttgttttgcc "aagttctaat tccatcagaa gctgactcta ggccgagctc caattcgccc 12300 
tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac 12360 • 
cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 12420 
agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 12480 
gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 12540 
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 12 600 
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 12 660 
agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg 12720 
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 12780- 
ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta 12840 
.taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 12 900 
.-aacgcgaatt ttaacaaaat attaacgctt acaatttagg tggcactttt cggggaaatg 12960 
tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 13020 
gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 13080 
atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 13140 
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 13200 
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 13260 
caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt- attgacgccg 13320 
ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 13380 
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 13440 
taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 13500- 
agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 13560- 
cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 13620 
caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 13680 
taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 13740 
ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 13800 
cagcactggg gccagatggt . aagccctccc gtatcgtagt tatctacacg acggggagtc 138 60 * 
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aggcaactat 
attggtaact 
tttaatttaa 
aacgtgagtt 
gagatccttt 
cggtggtttg 
gcagagcgca 
agaactctgt 
ccagtggcga 
cgcagcggtc 
acaccgaact 
gaaaggcgga 
ttccaggggg 
agcgtcgatt 
cggccttttt 
tatcccctga 
gcagccgaac 
gcaaaccgcc 
ccgactggaa 
caccccaggc 
aacaatttca 
actaaaggga 



ggatgaacga 
gtcagaccaa 
aaggatctag 
ttcgttccac 
ttttctgcgc 
tttgccggat 
gataccaaat 
agcaccgcct 
taagtcgtgt 
gggctgaacg 
gagataccta 
caggtatccg 
aaacgcctgg 
tttgtgatgc 
acggttcctg 
ttctgtggat 
gaccgagcgc 
tctccccgcg 
agcgggcagt 
tttacacttt 
cacaggaaac 
acaaaagctg 



aatagacaga 
gtttactcat 
gtgaagatcc 
tgagcgtcag 
gtaatctgct 
caagagctac 
actgtccttc 
acatacctcg 
cttaccgggt 

gggggttcgt 

cagcgtgagc 
gtaagcggca 
tatctttata 
tcgtcagggg 
gccttttgct 
aaccgtatta 
agcgagtcag 
cgttggccga 
gagcgcaacg 
atgcttccgg 
agctatgacc 
tcgagatcta 



23/39 

tcgctgagat 
atatacttta 
tttttgataa 
accccgtaga 
gcttgcaaac 
caactctttt 
tagtgtagcc 
ctctgctaat 
tggactcaag 
gcacacagcc 
tatgagaaag 
gggtcggaac 
gtcctgtcgg 
ggcggagcct 
ggccttttgc 
ccgcctttga 
tgagcgagga 
ttcattaatg 
caattaatgt 
ctcgtatgtt 
atgattacgc 
gatatcgatg 



aggtgcctca 
gattgattta 
tctcatgacc 
aaagatcaaa 
aaaaaaacca 
tccgaaggta 
gtagttaggc 
cctgttacca 
acgatagtta 
cagcttggag 
cgccacgctt 
aggagagcgc 
gtttcgccac 
jatggaaaaac 
tcacatgttc 
gtgagctgat 
agcggaagag 
cagctggcac 
gagttagctc 
gtgtggaatt 
caagcgcgca 
gccatagagt 



ctgattaagc 
aaacttcatt 
aaaatccctt 
ggatcttctt- 
ccgctaccag 
actggcttca 
caccacttca 
gtggctgctg 
ccggataagg 
cgaacgacct 
cccgaaggga 
acgagggagc 
ctctgacttg 
gccagcaacg 
tttcctgcgt 
accgctcgcc 
cgcccaatac 
gacaggtttc 
actcattagg 
gtgagcggat 
attaaccctc 
tacg 
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13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15174 



<210> 9 
<211> 4641 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22.3> Description of Artificial Sequence: 
insert 



Rluc tetOl 



<400> 9 

tctaggtaac 

tttccttgat 

caaactcttc 

tctaagtaag 

gaccccgagc 

atgaacgtgc 

gtgatttttc 

atcgagcccg 

agcgggaatg 

ctgctgaacc 

tttcactact. 

gtggacgtga 

aagagcgaag 

ccaagcaaga 

aaggagaagg 

aagggaggca 

agcgacgatc 

gtcgagggag 

agccaggagg 

ctgaagaacg 

cattattttc 

gaatgaggcg 

.agaatgacct 

gccagggatc 

actactcaag 

attcctgcgc- 

gtggcatccc 

gcccaccagc 

ctataatatt 



cgatatccct 
gatgtcatac 
gcggtctttc 
cttggcattc 
aacgcaaacg 
tggactcctt 
tgcatggtaa 
tggctagatg 
gctcatatcg 
ttccaaagaa 
cctacgagca 
tcgagtcctg 
agggcgagaa 
tcatgcggaa 
gcgaggttag 
agcccgacgt 
tgcctaagat 
ctaagaagtt 
acgctccaga 
agcagtaatt 
aatagatctg 
cggccaaggg 
tgggggaggg 
ttcaagcaga 
aactacgggc 
atcgtgcagt 
tgtgacccct 
cttgtcctaa 
atggggtgga 



gcaggggtga 
ttatcctgtc 
cagtactcct 
cggtactgtt 
catgatcact 
catcaactac 
cgctgcctcc 
catcatccct 
cctcctggat 
aatcatcttt 
ccaagacaag 
ggacgagtgg 
aatggtgctt 
actggagcct 
acggcctacc 
cgtccagatt 
gttcatcgag 
ccctaacacc 
tgaaatgggt 
ctagaccggt 
tgtgttggtt 
ggagggggag 
ggsiggccaga 
cctacagcaa 
tgctctactg 
gccgctctgt 
ccccagtgcc 
taaaattaag 
ggggggtggt 



cctgcacgtc 
cctttttttt 
gcaggtgact 
ggtaaagcca 
gggcctcagt 
tatgattccg 
agctacctgt 
gatctgatcg 
cactacaagt 
gtgggccacg 
atcaaggcca 
cctgacatcg 
gagaataact 
gaggagttcg 
ctctcctggc 
gtccgcaact 
tccgaccctg 
gagttcgtga 
aagtacatca 
tcgagatcca 
ttttgtgtgc 
gccagaatga 
atgaggcgcg 
gttcgacaca 
cttcaggaag 
ggagggcagc 
tctcctggcc 
ttgcatcatt 
atggagcaag 



tagggcgcag 
ccacagctcg. 
gactgagtcg 
ccatggcttc 
ggtgggctcg 
agaagcacgc 
ggaggcacgt 
gaatgggtaa 
acctcaccgc 
actggggggc 
tcgtccatgc 
aggaggatat 
tcttcgtcga 
ctgcctacct 
ctcgcgagat 
acaacgccta 
ggttcttttc 
aggtgaaggg 
agagcttcgt 
ggcgcggatc 
cttgggggag 
ccttggggga 
gatccgtcga 
aactcacaca 
gacatggaca 
tgtggcttct 
ctggaagttg 
ttgtctgact 
gggcaagttg 



tagtccaggg 

cggttgagga 

agatctgcga 

caaggtgtac 

ctgcaagcaa 

cgagaacgcc 

cgtgcctcac 

gtccggcaag 

ttggttcgag 

ttgtctggcc 

tgagagtgtc 

cgccctgatc 

gaccatgctc 

ggagccattc 

ccctctcgtt 

ccttcgggcc 

caacgctatt 

cctccacttc 

ggagcgcgtg 

aataaaagat 

ggggaggcca 

gggggaggcc 

cttaattaag 

acgatgacgc 

aggtcgagac 

agctgcccgg 

ccactccagt 

aggtgtcctt* 

ggaagacaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 ■ 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140. 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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ctgtagggcc 

cactgcaatc 

gggattccag 

tttcaccata 

gcctcccaaa 

tttaaaataa 

ccaaccggtg 

tcagtagatg 

tatgctatac 

ggagcgtgac 

•ccttcatatt 

gtaaacacaa 

tttgcagttt 

tactctatca 

tcagcgggag 

ttttctagac 

aaccgaagtt 

attctaccgg 

cccgctgggc 

cggtaggcgc 

cctagtcagg 

agtagcacgt 

aggcctttgg 

gggaaggggt 

aaggtcctcc 

ctcctcttcc 

aagatggatt 

gggcacaaca 

gcccggttct 

cagcgcggct 

tcactgaagc 

catctcacct 

atacgcttga 

cacgtactcg 

ggctcgcgcc 

tcgtcgtgac 

ctggattcat 

ctacccgtga 

acggtatcgc 

tctgagggga 

tgtccactaa. 

cctacatttt 

atgcctgctc 

tatcataatt 

ctatagatct 

tctaagtact 

ctctgttcca 

gctgactcta 

aggaacttca 



tgcggggtct 
tccgcctcct 
gcatgcatga 
ttggccaggc 
ttgctgggat 
ctataccagc 
ggacatttga 
cctgttgaat 
gaagttatgg 
cgcgcgccga 
tgcatatacg 
agatattagt 
taaaattatg 
ttgatagagt 
ccacctgatg 
tcgagataac 
cctatacttt 
gtaggggagg 
acttggcgct 
caaccggctc 
aagttccccc 
ctcactagtc 
ggcagcggcc 
gggtccgggg 
ggaggcccgg 
tcatctccgg 
gcacgcaggt 
gacaatcggc 
ttttgtcaag 
atcgtggctg 
ggga'agggac 
tgctcctgcc 
tccggctacc 
gatggaagcc 
agccgaactg 
ccatggcgat 
cgactgtggc 
tattgctgaa 
.cgctcccgat 
tcgatccgct 
aatggaagtt 
gaatggaagg 
tttactgaag 
taaacaagca 
atagatctct 
gtggtttcca 
catacacttc 
gatcccgcgc 
agcttaagcg 



attgggaacc 
gggttcaagc 
ccaggctcag 
tggtctccaa 
tacaggcgtg 
aggaggacgt 
gttgcttgct 
taagcttatt 
atccagtgga 
gcccaaggtc 
atacaaggct 
acaaaatacg 
ttttaaaatg 
tatatatctt 
aagcttgatc 
ttcgtataat 
ctagagaata 
cgcttttccc 
acacaagtgg 
cgttctttgg 
ccgccccgca 
tcgtgcagat 
aatagcagct 
gcgggctcag 
cattctgcac 
gcctttcgac 
tctccggccg 
tgctctgatg 
accgacctgt 
gccacgacgg 
tggctgctat 
gagaaagtat 
tgcccattcg 
ggtcttgtcg 
ttcgccaggc 
gcctgcttgc 
cggctgggtg 
gagcttggcg 
tcgcagcgca 
gtaagtctgc 
tttcctgtca 
attggagcta 
gctctttact 
aaaccaaatt 
cgtgggatca 
aatgtgtcag 
attctcagta 
cgaagttcct 



24/39 

aagctggagt 
gattctcctg 

ctaatttttg 
ctcctaatct 
aaccactgct 
ccagacacag 
tggcactgtc 
taaataggcc 
aagacgcgca 
gggcaggaag 
gttagagaga 
tgacgtagaa 
gactatcata 
gtggaaagga 

gggtggctct 

gtatgctata 

ggaacttcgg 

aaggcagtct 

cctctggcct 

tggccccttc 

gctcgcgtcg 

ggacagcacc 

ttgctccttc 

gggcgggctc 

gcttcaaaag 

ctgcagccaa 

cttgggtgga 

ccgccgtgtt 

ccggtgccct 

gcgttccttg 

tgggcgaagt 

ccatcatggc 

accaccaagc 

atcaggatga 

tcaaggcgcg 

cgaatatcat 

tggcggaccg 

gcgaatgggc 

tcgccttcta 

agaaattgat 

tactttgtta 

cgggggtggg 

attgctttat 

aagggccagc 

ttgtttttct 

tttcatagcc 

ttgttttgcc 

atactttcta 
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gcagtggcac 
cctcagcctc 
tttttttggt 
caggtgatct 
cccttccctg 
cataggctac 
ctctcatgcg 
ggccataact 
ggcaaaacgc 
agggcctatt 
taattagaat 
agtaataatt 
tgcttaccgt 
cgaaacaccg 
cgctgagttg 
cgaagttatg 
aataggaact 
ggagcatgcg 
cgcacacatt 
gcgccacctt 
tgcaggacgt 
gctgagcaat 
gctttctggg 
aggggcgggg 
cgcacgtctg 
tatgggatcg 
gaggctattc 
ccggctgtca 
gaatgaactg 
cgcagctgtig 
gccggggcag 
tgatgcaatg 
gaaacatcgc 
tctggacgaa 
catgcccgac 
ggtggaaaat 
ctatcaggac 
tgaccgcttc 
tcgccttctt 
. gatctattaa 
agaagggtga 
ggtggggtgg 
gataatgttt 
tcattcctcc 
cttgattccc 
tgaagaacga 
aagttctaat 
gagaatagga 



aatcttggct 
ccgagttgtt 
agagacgggg 
acccaccttg 
tccttctgat 
ctggccatgc 
ttgggtccac 
tcgtataatg 
accacgtgac 
tcccatgatt 
taatttgact 
tcttgggtag 
aacttgaaag 
ggattccaat 
gaatccattt 
gcgcgccggt 
tcttaggtca 
ctttagcagc 
ccacatccac 
ctactcctcc 
gacaaatgga 
ggaagcgggt 
ctcagaggct 
cgggcgcccg 
ccgcgctgtt 
gccattgaac 
ggctatigact 
gcgcaggggc 
caggacgagg 
ctcgacgttg 
gatctcctgt 
cggcggctgc 
atcgagcgag 
gagcatcagg 
ggcgaggatc 
ggccgctttt 
atagcgttgg 
ctcgtgcttt 
gacgagttct 
acaataaaga 
gaacagagta 
gattagataa 
catagttgga 
cactcatgat 
actttgtggt 
gatcagcagc 
tccatcagaa 
acttcggaat 



1800 
1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540. 

3600 

3660 

3720 

3780 . 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440. 

4500 

4560 

4620 

4641 



<210> 10 
<211> 4640 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description* of Artificial Sequence: Rluc tet02 
insert 

<400>' 10 

tctaggtaac cgatatccct gcaggggtga cctgcacgtc tagggcgcag tagtccaggg 60 
tttccttgat gatgtcatac ttatcctgtc cctttttttt ccacagctcg cggttgagga 120 
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caaactcttc 

tctaagtaag 

gaccccgagc 

atgaacgtgc 

gtgatttttc 

atcgagcccg. 

agcgggaatg 

ctgctgaacc 

tttcactact 

gtggacgtga 

aagagcgaag 

ccaagcaaga 

aaggagaagg 

aagggaggca 

agcgacgatc 

gtcgagggag 

agccaggagg 

ctgaagaacg 

cattattttc. 

gaatgaggcg 

agaatgacct 

gccagggatc 

actactcaag 

attcctgcgc 

gtggcatccc. 

gcccaccagc 

ctataatatt 

ctgtagggcc 

cactgcaatc 

gggattccag 

tttcaccata 

gcctcccaaa 

tttaaaataa 

ccaaccggtg 

tcagtagatg 

tatgctatac 

ggagcgtgac 

ccttcatatt 

gtaaacacaa 

tttgcagttt 

tatttcgatt 

cagcgggagc 

tttctagact 

accgaagttc 

ttctaccggg 

ccgctgggca 

ggtaggcgcc 

ctagtcagga 

gtagcacgtc 

ggcctttggg 

ggaaggggtg 

aggtcctccg 

tcctcttcct 

agatggattg 

ggcacaacag 

cccggttctt 

agcgcggcta 

cactgaagcg- 

atctcacctt 

tacgcttgat 

acgtactcgg 

gctcgcgcca 

cgtcgtgacc 



gcggtctttc 

cttggcattc' 

aacgcaaacg 

tggactcctt 

tgcatggtaa 

tggctagatg 

gctcatatcg 

ttccaaagaa 

cctacgagca . 

tcgagtcctg 

agggcgagaa 

tcatgcggaa 

gcgaggttag 

agcccgacgt 

tgcctaagat 

ctaagaagtt 

acgctccaga 

agcagtaatt 

aatagatctg 

cggccaaggg 

tgggggaggg 

ttcaagcaga 

aactacgggc 

atcgtgcagt 

tgtgacccct 

cttgtcctaa 

atggggtgga 

tgcggggtct 

tccgcctcct 

gcatgcatga 

ttggccaggc 

ttgctgggat 

ctataccagc 

ggacatttga 

cctgttgaat 

gaagttatgg 

cgcgcgccga 

tgcatatacg 

agatattagt 

taaaattatg. 

tcttggcttt 

cacctgatga 

cgagataact 

ctatactttc 

taggggaggc 

cttggcgcta 

aaccggctcc 

agttcccccc 

tcactagtct 

gcagcggcca 

ggtccggggg 

gaggcccggc 

catctccggg 

cacgcaggtt 

acaatcggct 

tttgtcaaga 

tcgtggctgg 

ggaagggact 

gctcctgccg 

ccggctacct 

atggaagccg- 

gccgaactgt 

catggcgatg 



cagtactcct 
cggtactgtt 
catgatcact 
catcaactac 
cgctgcctcc 
catcatccct 
cctcctggat 
aatcatcttt 
ccaagacaag 
ggacgagtgg 
aatggtgctt 
actggagcct 
acggcctacc 
cgtccagatt 
gttcatcgag 
ccctaacacc 
tgaaatgggt 
ctagaccggt 
tgtgttggtt 
ggagggggag 
ggaggccaga 
cctacagcaa 
tgctctactg 
gccgctctgt 
ccccagtgcc 
taaaattaag 

ggggggtggt 
attgggaacc 
gggttcaagc 
ccaggctcag 
tggtctccaa 
tacaggcgtg 
aggaggacgt 
gttgcttgct 
taagcttatt 
atccagtgga 
gcccaaggtc 
atacaaggct 
acaaaatacg 
ttttaaaatg 
atatatctcc 
agcttgatcg 
tcgtataatg 
tagagaatag 
gcttttccca 
cacaagtggc 
gttctttggt 
cgccccgcag 
cgtgcagatg 
atagcagctt 
cgggctcagg 
attctgcacg 
cctttcgacc 
ctccggccgc 
gctctgatgc 
ccgacctgtc 
ccacgacggg 
ggctgctatt 
agaaagtatc 
gcccattcga 
gtcttgtcga 
tcgccaggct 
cctgcttgcc 
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gcaggtgact 
ggtaaagcca 

gggcctcagt 
tatgattccg 
agctacctgt 
gat ct gat eg 
cactacaagt 
gtgggccacg 
atcaaggcca 
cctgacatcg 
gagaataact 
gaggagttcg 
ctctcctggc 
gtccgcaact 
tccgaccctg 
gagttcgtga 
aagtacatca 
tcgagatcca 
ttttgtgtgc 
gccagaatga 
atgaggcgcg 
gttcgacaca 
cttcaggaag 
ggagggcagc 
tctcctggcc 
ttgcatcatt 
atggagcaag 
aagctggagt 
gattctcctg 
ctaatttttg 
ctcctaatct 
aaccactgct 
ccagacacag 
tggcactgtc 
taaataggcc 
aagacgcgca 
gggcaggaag 
gttagagaga 
tgacgtagaa 
gactatcata 
ctatcagtga 
ggtggctctc 
tatgctatac 
gaacttcgga 
aggcagtctg 
ctctggcctc 
ggccccttcg 
ctcgcgtcgt 
gacagcaccg 
tgctccttcg 
99cgggctca 
cttcaaaagc 
tgcagccaat 
ttgggtggag 
cgccgtgttc 
cggtgccctg 
cgttccttgc 
gggcgaagtg 
catcatggct 
ccaccaagcg 
tcaggatgat 
caaggcgcgc 
gaatatcatg 
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gactgagtcg 
ccatggcttc 
ggtgggctcg 
agaagcacgc 
ggaggcacgt 
gaatgggtaa 
acctcaccgc 
actggggggc 
tcgtccatgc 
aggaggatat 
tcttcgtcga 
ctgcctacct 
ctcgcgagat 
acaacgccta 
ggttcttttc 
aggtgaaggg 
agagcttcgt 
ggcgcggatc 
cttgggggag 
ccttggggga 
gatccgtcga 
aactcacaca 
gacatggaca 
tgtggcttct 
ctggaagttg 
ttgtctgact 
gggcaagttg 
gcagtggcac 
cctcagcctc 
tttttttggt 
caggtgatct 
cccttccctg 
cataggctac 
ctctcatgcg 
ggccataact 
ggcaaaacgc 
agggcctatt 
taattagaat 
agtaataatt 
tgcttaccgt 
tagagaaagg 
gctgagttgg 
gaagttatgg 
ataggaactt 
gagcatgcgc 
gcacacattc 
cgccaccttc 
gcaggacgtg 
ctgagcaatg 
ctttctgggc 
ggggcggggc 
gcacgtctgc 
atgggatcgg 
aggctattcg 
cggctgtcag 
aatgaactgc 
gcagctgtgc 
ccggggcagg 
gatgcaatgc 
aaacatcgca 
ctggacgaag 
atgcccgacg 
gtggaaaatg 



agatctgcga 
caaggtgtac 
ctgcaagcaa 
cgagaacgcc 
cgtgcctcac 
gtccggcaag 
ttggttcgag 
ttgtctggcc 
tgagagtgtc 
cgccctgatc 
gaccatgctc 
ggagccattc 
ccctctcgtt 
ccttcgggcc 
caacgctatt 
cctccacttc 
ggagcgcgtg 
aataaaagat 
ggggaggcca 
gggggaggcc 
cttaattaag 
acgatgacgc 
aggtcgagac 
agctgcccgg 
ccactccagt 
aggtgtcctt 
ggaagacaac 
aatcttggct 
ccgagttgtt 
agagacgggg 
acccaccttg 
tccttctgat 
ctggccatgc 
ttgggtccac 
tcgtataatg 
accacgtgac 
tcccatgatt 
taatttgact 
tcttgggtag 
aacttgaaag 
gattccaatt 
aatccatttt 
cgcgccggta 
cttaggtcaa- 
tttagcagcc 
cacatccacc 
tactcctccc 
acaaatggaa 
gaagcgggta 
tcagaggctg 
gggcgcccga 
cgcgctgttc 
ccattgaaca 
gctatgactg 
cgcaggggcg 
aggacgaggc 
tcgacgttgt 
atctcctgtc 
ggcggctgca 
tcgagcgagc 
agcatcaggg 
gcgaggatct 
gccgcttttc 



180 . 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 ■• 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540" 
3600 
3660 
•3720 
•3780 
3840 
3900 
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tggattcatc 
tacccgtgat 
cggtatcgcc 
ctgaggggat 
gtccactaaa 
ctacattttg 
tgcctgctct 
atcataattt 
tatagatcta 
ctaagtactg 
tctgttccac 
ctgactctag 
ggaacttcaa 



gactgtggcc 
attgctgaag 
gctcccgatt 
cgatccgctg 
atggaagttt 
aatggaagga 
ttactgaagg 
aaacaagcaa 
tagatctctc 
tggtttccaa 
atacacttca 
atcccgcgcc 
gcttaagcgc 



ggctgggtgt 
agcttggcgg 
cgcagcgcat 
taagtctgca 
ttcctgtcat 
ttggagctac 
ctctttacta 
aaccaaatta 
gtgggatcat 
atgtgtcagt 
ttctcagtat 
gaagttccta 
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ggcggaccgc 
cgaatgggct 
cgccttctat 
gaaattgatg 
actttgttaa 
gggggtgggg 
ttgctttatg 
agggccagct 
tgtttttctc 
ttcatagcct 
tgttttgcca 
tactttctag 
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tatcaggaca 
gaccgcttcc 
cgccttcttg 
atctattaaa 
gaagggtgag 
gtggggtggg 
ataatgtttc 
cattcctccc 
ttgattccca 
gaagaacgag 
agttctaatt 
agaataggaa 



tagcgttggC 
tcgtgcttta 
acgagttctt 
caataaagat 
aacagagtac 
attagataaa 
atagttggat 
actcatgatc 
ctttgtggtt 
atcagcagcc 
ccatcagaag 
cttcggaata 



3'960 
4020 
4080 
4140 
4200 
•4260 
4320 
4380 
4440 
4500 
4560 
4620 
4640 



<210> 11 . . 

<211> 3387 

<212> DNA : 

<213> Artificial Sequence 

<220> . ' ; . . ' 

<223> Description of Artificial Sequence: HI tet5' shRNA 
vector 



<400> 11 

agctcgatat 

cacagtaggt 

tgagctccga 

aattcgaacg 

gggaacaccc 

cgccctgcaa 

cctatcagtg 

cagcgggagc 

tttctaggat 

ttttgttccc 

cctgtgtgaa 

tgtaaagcct 

cccgctttcc 

gggagaggcg 

tcggtcgttc 

acagaatcag 

aaccgtaaaa 

cacaaaaatc 

gcgtttcccc 

tacctgtccg 

tatctcagtt 

cagcccgacc 

gacttatcgc 

ggtgctacag 

ggtatctgcg 

ggcaaacaaa 

agaaaaaaag 

aacgaaaact 

atccttttaa 

tctgacagtt 

tcatccatag 

tctggcccca 

gcaataaacc 

tccatccagt 

ttgcgcaacg 

gcttcattca 

aaaaaagcgg 

ttatcactca 



cggccggcca 
ggcatcgttc 
acctctcgcc 
ctgacgtcat 
agcgcgcgtg 
tatt'tgcatg 
atagagatta 
cacctgatga 
aacttcgtat 
tttagtgagg 
attgttatcc 
ggggtgccta 
agtcgggaaa 
gtttgcgtat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 
cactggcagc 
agttcttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
btattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 



taacttcgta 
ctttctgact 
ctgccgccgc 
caacccgctc 
cgccctggca 
tcgctatgtg 
taagttctgt 
agcttgatcg 
aatgtatgct 
gttaatttcg 
gctcacaatt 
atgagtgagc 
cctgtcgtgc 
tgggcgctct 
agcggtatca 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
.atccggtaac 
agccactggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 



taatgtatgc 
gcccgccccc 
cggtgctccg 
caaggaatcg 
ggaagatggc 
ttctgggaaa 
atgagaccac 
ggtggctctc 
atacgaagtt 
agcttggcgt 
ccacacaaca 
taactcacat 
cagctgcatt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 
aacaggatta 
aactacggct 
ttcggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 



tatacgaagt 
cgcatgccgt 
tcgccgccgc 
cgggcccagt 
tgtgagggac 
tcaccataaa 
tctttcccag 
gctgagttgg 
atggcgcgcc 
aatcatggtc 
tacgagccgg 
taattgcgtt 
aatgaatcgg 
cgctcactga 
aggcggtaat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcatagctc 
gtgtgcacga 
agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 
cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 



tatggatcct 
cccgcgatat 
gccgccatgg 
gtcactaggc 
aggggagtgg 
cgtgaaattc 
gattccaatt 
aatccatttt 
ggtacccagc 
atagctgttt 
aagcataaag 
gcgctcactg 
ccaacgcgcg 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 
ggtaagacac 
gtatgtaggc 
gacagtattt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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tgcttttctg. tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga 2340 
ccgagttgct' cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta 2400 
aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 2460 . 
ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 2520 . 
ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 2580 
agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt 2640 
tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 2700 
ataggggttc cgcgcacatt tccccgaaaa gtgccaccta aattgtaagc gttaatattt 2760. 
tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa taggccgaaa 2820 
tcggcaaaat cccttataaa tcaaaagaat agaccgagat agggttgagt gttgttccag 2880 
tttggaacaa gagtccacta ttaaagaacg tggactccaa cgtcaaaggg cgaaaaaccg 2940 
tctatcaggg cgatggccca ctacgtgaac catcacccta atcaagtttt ttggggtcga 3000 • 
ggtgccgtaa agcactaaat cggaacccta aagggagccc ccgatttaga gcttgacggg 3060 
gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc gaaaggagcg ggcgctaggg 3120 
cgctggcaag tgtagcggtc acgctgcgcg taaccaccac acccgccgcg cttaatgcgc 3180 
cgctacaggg cgcgtcccat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg. 3240 
tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg atgtgctgca- aggcgattaa 3300 
gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgaattgt 3360 • 
aatacgactc actatagggc gaattgg 3387 

<210> 12 . " 

<211> 3387 

<212> DNA . . 

•<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: HI tet3' shRNA 
vector 

.<400> 12 " 

agctcgatat cggccggcca taacttcgta taatgtatgc tatacgaagt tatggatcct 60 
cacagtaggt ggcatcgttc ctttctgact gcccgccccc cgcatgccgt cccgcgatat 120 
tgagctccga acctctcgcc ctgccgccgc cggtgctccg tcgccgccgc gccgccatgg 180 
aattcgaacg ctgacgtcat caacccgctc caaggaatcg cgggcccagt gtcactaggc 240 
gggaacaccc agcgcgcgtg cgccctggca ggaagatggc tgtgagggac aggggagtgg 300 
cgccctgcaa tatttgcatg tcgctatgtg ttctgggaaa tcaccataaa cgtgaaatgt 360 
ctttggattt gggaatctta taagtcccta tcagtgatag agattcccag gattccaatt 420 
cagcgggagc cacctgatga agcttgatcg ggtggctctc gctgagttgg aatccatttt 48 0 
tttctaggat aacttcgtat aatgtatgct atacgaagtt atggcgcgcc ggtacccagc 540 
ttttgttccc tttagtgagg gttaatttcg agcttggcgt aatcatggtc atagctgttt 600 
cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag 660 
tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg 720 
cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg 780 
gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc 840 
tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc 900 
acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 960 
aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 1020 
cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 108 0 
gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 1140 
tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 1200 
tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 12 60 
cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 1320 
gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 1380 
ggtgctacag agttcttg.aa gtggtggcct aactacggct acactagaag gacagtattt 14 40 
ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 1500 
ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt .gcaagcagca gattacgcgc 1560 
agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 1620 
aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 1680 ■ 
atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 1740 
tctg.acagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 1800 
tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca i8 60 
tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca 1920 
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gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc 1980 

tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt 2040 

ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg 2100 

gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc 2160 

aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg 2220 

ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga 2280 

tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga 2340 

ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta 2400 

aaagtgctca tcattggaaa acgttcttcg gggcgaaaac. tctcaaggat cttaccgctg 24 60 

ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 2520 

ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 2580 

agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt 2640 

tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 2700 

ataggggttc cgcgcacatt tccccgaaaa gtgccaccta aattgtaagc gttaatattt 2760 

tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa taggccgaaa 2820 

tcggcaaaat cccttataaa tcaaaagaat agaccgagat agggttgagt gttgttccag 28 80 

tttggaacaa gagtccacta ttaaagaacg tggactccaa cgtcaaaggg cgaaaaaccg 2940 

tctatcaggg cgatggccca ctacgtgaac catcacccta atcaagtttt ttggggtcga 3000 

ggtgccgtaa agcactaaat cggaacccta aagggagccc ccgatttaga gcttgacggg 3060 

gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc gaaaggagcg ggcgctaggg 3120 

cgctggcaag tgtagcggtc acgctgcgcg taaccaccac acccgccgcg cttaatgcgc 3180 

cgctacaggg cgcgtcccat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg 3240 

tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg atgtgctgca aggcgattaa 3300 

gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgaattgt 3360 

aatacgactc actatagggc gaattgg 3387 

<210> 13 . " . • - 

<211> 3387 

<212> DNA • 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: HI tetS' +3' 
shRNA vector 

. <400> 13 ' 

agctcgatat cggccggcca taacttcgta taatgtatgc tatacgaagt tatggatcct 60 

cacagtaggt ggcatcgttc ctttctgact . gcccgccccc cgcatgccgt cccgcgatat 120 

tgagctccga acctctcgcc ctgccgccgc cggtgctccg tcgccgccgc gccgccatgg 180. 

aattcgaacg ctgacgtcat caacccgctc caaggaatcg cgggcccagt gtcactaggc 240 

gggaacaccc agcgcgcgtg cgccctggca ggaagatggc tgtgagggac aggggagtgg 300 

cgccctgcaatatttgcatg tcgctatgtg ttctgggaaa tcaccataaa cgtgaaattc 360 

cctatcagtg atagagatta taagtcccta tcagtgatag agattcccag gattccaatt 420 

cagcgggagc cacctgatga agcttgatcg ggtggctctc gctgagttgg aatccatttt 480 

tttctaggat aacttcgtat aatgtatgct atacgaagtt atggcgcgcc ggtacccagc 540 

ttttgttccc tttagtgagg gttaatttcg agcttggcgt' aatcatggtc atagctgttt 600 

cctgtgtgaa attgttatcc- gctcacaatt ccacacaaca tacgagccgg aagcataaag 660 

tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg 720 

cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg 780- 

gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc 840 

tcggtcgttc ggctgcggcg agcggtatca gctcactc'aa aggcggtaat acggttatcc 900 

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 960 

aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 1020 

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 1080 

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 1140 

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc. acgctgtagg 1200 

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 1260 

cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 1320 

gacttatcgc cactggcagc agccactggt * aacaggatta gcagagcgag gtatgtaggc 1380 

ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt 1440 

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 1500 

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 1560 • 
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agaaaaaaag 
aacgaaaact 
atccttttaa 
tctgacagtt 
tcatccatag 
tctggcccca 
gcaataaacc 
tccatccagt 
ttgcgcaacg 
gcttcattca 
aaaaaagcgg 
ttatcactca 
tgcttttctg 
ccgagttgct 
aaagtgctca 
ttgagatcca 
ttcaccagcg 
agggcgacac 
tatcagggtt 
ataggggttc 
tgttaaaatt 
tcggcaaaat 
tttggaacaa 
tctatcaggg 
ggtgccgtaa 
gaaagccggc 
cgctggcaag 
cgctacaggg 
t'gcgggcctc 
gttgggtaac 
aatacgactc 



gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 
tcattggaaa 
gttcgatgta 
tttctgggtg 
ggaaatgttg 
attgtctcat 
cgcgcacatt 
cgcgttaaat 
cccttataaa 
gagtccacta 
cgatggccca 
agcactaaat 
gaacgtggcg 
tgtagcggtc 
cgcgtcccat 
ttcgctatta 
gccagggttt 
actatagggc 



agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gtcaatacgg 
acgttcttcg 
acccactcgt 
agcaaaaaca 
aatactcata 
gagcggatac 
tccccgaaaa 
ttttgttaaa 
tcaaaagaat 
ttaaagaacg 
ctacgtgaac 
cggaacccta 
agaaaggaag 
acgctgcgcg 
tcgccattca 
cgccagctgg 
tcccagtcac 
gaattgg 



29/39 

atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcattct 
gataataccg 
ggcfcgaaalac 
gcacccaact 
ggaaggcaaa 
ctcttccttt 
atatttgaat 
gtgccaccta 
tcagctcatt 
agaccgagat 
tggactccaa 
catcacccta 
aagggagccc 
ggaagaaagc 
taaccaccac 
ggctgcgcaa 
cgaaaggggg 
gacgttgtaa 



cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 
gagaatagtg 
cgccacatag 
tctcaaggat 
gatcttcagc 
atgccgcaaa 
ttcaatatta 
gtatttagaa 
aattgtaagc 
ttttaaccaa 
agggttgagt 
cgtcaaaggg 
atcaagtttt 
ccgatttaga 
gaaaggagcg 
acccgccgcg 
ctgttgggaa 
atgtgctgca 
aacgacggcc 



cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttta 
cttaccgctg 
atcttttact 
aaagggaata 
ttgaagcatt 
aaataaacaa 
gttaatattt 
taggccgaaa 
gttgttccag 
cgaaaaaccg 
ttggggtcga 
gcttgacggg 
ggcgctaggg 
cttaatgcgc 
gggcgatcgg 
aggcgattaa 
agtgaattgt 
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1620 
1680 
1740 
1800 
18 60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3387 



<210> 14 
<211> 7209 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> .Description of Artificial Sequence: 
Fluc-GAGGS-tetR-hygro insert 



<400> 14 

tctaggtaa'c 

tttccttgat 

caaactcttc 

tctaagtaag 

ataaagaaag 

cataaggcta 

atcgaggtgg 

atgaaacgat 

caattcttta 

gacatttata 

ttcgtttcca 

caaaaaatta 

ttcgtcacat 

gatagggaca 

ggtgtcgctc 

tttggcaatc 

tttggaatgt 

agatttgaag 

ctggtgccaa 

tctaatttac 



cgatatccct 
gatgtcatac 
gcggtctttc 
cttggcattc 
gcccggcgcc 
tgaagagata 
acatcactta 
atgggctgaa 
tgccggtgtt 
atgaacgtga 
aaaaggggtt 
ttatcatgga 
ctcatctacc 
agacaattgc 
tgcctcatag 
aaatcattcc 
ttactacact 
aagagctgtt 
ccctattctc 
acgaaattgc 



gcaggggtga 
ttatcctgtc 
cagtactcct 
cggtactgtt 
attctatccg 
cgccctggtt 
cgctgagtac 
tacaaatcac 
gggcgcgtta 
attgctcaac 
gcaaaaaatt 
ttctaaaacg 
tcccggtttt 
actgatcatg 
aactgcctgc 
ggatactgcg 
cggatatttg 
tctgaggagc 
cttcttcgcc 
ttctggtggc' 



cctgcacgtc 
cctttttttt 
gcaggtgact 
ggtaaagcca 
ctggaagatg 
cctggaacaa 
ttcgaaatgt 
agaatcgtcg 
tttatcggag 
agtatgggca 
ttgaacgtgc 
gattaccagg 
aatgaatacg 
aactcctctg 
gtgagattct 
attttaagtg 
atatgtggat 
cttcaggatt 
aaaagcactc 
gctcccctct 



tagggcgcag 
ccacagctcg 
gactgagtcg 
ccatggaaga 
gaaccgctgg 
ttgcttttac 
ccgttcggtt 
tatgcagtga 
ttgcagttgc 
tttcgcagcc 
aaaaaaagct 
gatttcagtc 
attttgtgcc 
gatctactgg 
cgcatgccag 
ttgttccatt 
ttcgagtcgt 
acaagattca 
tgattgacaa 
ctaaggaagt 



tagtccaggg 
cggttgagga 
agatctgcga 
cgccaaaaac 
agagcaactg 
agatgcacat 
ggcagaagct 
aaactctctt 
gcccgcgaac 
taccgtggtg 
cccaatcatc 
gatgtacacg 
agagtccttc 
tctgcctaaa 
agatcctatt 
ccatcacggt 
cttaatgtat 
aagtgcgctg 
atacgattta 
cggggaagcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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gttgccaaga 
tcagctattc 
ccattttttg 
agaggcgaac 
gcgaccaacg 
gacgaagacg 
tatcaggtgg 
gcaggtgtcg 
ttggagcacg 
acaaccgcga 
accggaaaac 
aagatcgccg 
tattttcaat 
tgaggcgcgg 
atgaccttgg 

•ctagcttgca 
attacggggt 
aatggcccgc 
gttcccatag 
taaactgccc 
gtcaatgacg 
cctacttggc 
cacgttctgc 
tattttttaa 
gggcggggcg 
agagcggcgc 
aaaagcgaag 
gcgccgcctc. 
ggcgggacgg 
cttttctgtg 
agcggctcgg 
tgcccggcgg 
cgaggggagc 
ggctgcgtgc 
ctgtaacccc 
ggggctccgt 
gggggtgccg 
cggccccgga 
taatcgtgcg 
ggaggcgccg 
ggaaatgggc 
gcctcggggc 
ggcttctggc 
cgggcatgac 

' cctggcccgc 
ttttgttgtc 
tgacatccac 
tcgccgatcc 
cgcattagag 
gaagctaggt 
cgacgcctta 
ggaaagctgg 
tcatcgcgat 
tctcgaaaat 
tgcactcagc 
agtcgctaaa 
agctatcgaa 
gatcatatgc 
tccaggcgcg ' 
gtgccttggg 
atgaccttgg 
cgcgcccccg 
gcgggcgatc 



ggttccatct 
tgattacacc 
aagcgaaggt 
tgtgtgtgag 
ccttgattga 
aacacttctt 
ctcccgctga 
caggtcttcc 
gaaagacgat 
aaaagttgcg 
tcgacgcaag 
tgtaattcta 
agatctgtgt 
ccaaggggga 
gggaggggga 
tgcctgcagg 
cattagttca 
ctggctgacc 
taacgccaat 
acttggcagt 
gtaaatggcc 
agtacatcta 
ttcactctcc 
ttattttgtg 
gggcgagggg 
gctccgaaag 
cgcgcggcgg 
gcgccgcccg 
cccttctcct 
gctgcgtgaa 

ggg^tgcgtg 

ctgtgagcgc 
gcggccgggg 
ggggtgtgtg 
cccctgcacc 
gcggggcgtg 
ggcggggcgg 
gcgccggcgg 
agagggcgca 
ccgcaccccc 
.ggggagggcc 
tgccgcaggg 
gtgtgaccgg 
ttctgcgcta 
ggtgatgcct 
aagcttgagg 
tttgcctttc 
gcctagcatt 
ctgcttaatg 
gtagagcagc 
gccattgaga 
caagattttt 
ggagcaaaag 
caattagcct 
gctgtggggc 
gaagaaaggg 
ttatttgatc 
ggattagaaa 
gatcaataaa 
ggagggggag 
gggaggggga. 
ggtaccgagc 
gcctaggggt 



gccaggtatc 
cgagggggat 
tgtggatctg 
aggtcctatg 
caaggatgga 
catcgttgac 
attggaatcc 
cgacgatgac 
gacggaaaaa 
cggaggagtt 
aaaaatcaga 
gaccggttcg 
gttggttttt 
gggggaggcc 
ggccagaatg 
ttttcgacat 
tagcccatat 
gcccaacgac 
agggactttc 
acatcaagtg 
cgcctggcat 
cgtattagtc 
ccatctcccc 
cagcgatggg 
cggggcgggg 
tttcctttta 
gcgggagtcg 
ccccggctct 
ccgggctgta 
agccttaaag 
cgtgtgtgtg 
tgcgggcgcg 
gcggtgcccc 
cgtggggggg 
cccctccccg 
gcgcggggct 
ggccgcctcg 
ctgtcgaggc 
gggacttcct 
tctagcgggc 
ttcgtgcgtc 
ggacggctgc 
cggctctaga 
agattgtcag 
ttgagggtgg 
tgtggcaggc 
tctccacagg 
caaccatggc 
aggtcggaat 
ctacattgta 
tgttagatag 
tacgtaataa 
tacatttagg 
ttttatgcca 
attttacttt 
aaacacctac 
accaaggtgc 
aacaacttaa 
agatcattat 
gccagaatga 
ggccagaatg 
tcgaattgtt 
aaccgaagtt 



30/39 

aggcaaggat 
gataaaccgg 
gataccggga 
attatgtccg 
tggctacatt 
cgcctgaagt 
atcttgctcc 
gccggtgaac 
gagatcgtgg 
gtgtttgtgg 
gagatcctca 
agatccaggc 
tgtgtgcctt 
agaatgacct 
aggcgcgccc 
tgattattga 
atggagttcc 
ccccgcccat 
cattgacgtc 
tatcatatgc 
tatgcccagt 
atcgctatta 
cccctcccca 
ggcggggggg 
cgaggcggag 
tggcgaggcg 
ctgcgttgcc 
gactgaccgc 
attagcgctt 
ggctccggga 
tgcgtgggga 
gcgcggggct 
gcggtgcggg 
tgagcagggg 
agttgctgag 
cgccgtgccg 
ggccggggag 
gcggcgagcc 
ttgtcccaaa 
gcgggcgaag 
gccgcgccgc 
cttcgggggg 
agcgttgggg 
tttccaaaaa 
ccgcgtccat 
ttgagatctg 
tgtccactcc 
tagattagat 
cgaaggttta 
ttggcatgta 
gcaccatact 
cgctaaaagt 
tacacggcct 
acaaggtttt 
aggttgcgta 
tactgatagt 
agagccagcc 
atgtgaaagt 
tttcaataga 
ggcgcggcca 
accttggggg 
aattaaggcc 
cctatacttt 
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atgggctcac 
gcgcggtcgg 
aaacgctggg 
gttatgtaaa 
ctggagacat 
ctctgattaa 
aacaccccaa 
ttcccgccgc 
attacgtcgc 
acgaagtacc 
taaaggccaa 
gcggatcaat 
gggggagggg 
tgggggaggg 
ccgatccgtc 
ctagttatta 
gcgttacata 
tgacgtcaat 
aatgggtgga 
caagtacgcc 
acatgacctt 
ccatgggtcg 
cccccaattt 

gggggggcgc 

aggtgcggcg 
gcggcggcgg 
ttcgccccgt 
gttactccca 
ggtttaatga 
gggccctttg 
gcgccgcgtg 
ttgtgcgctc 
ggggctgcga 
gtgtgggcgc 
cacggcccgg 
ggcggggggt 
ggctcggggg 
gcagccattg 
tctggcggag 
cggtgcggcg 
cgtccccttc 
gacggggcag 
tgagtactcc 
cgaggaggat 
ctggtcagaa 
gccatacact 
cagggcggcc 
aaaagtaaag 
acaacccgta 
aaaaataagc 
cacttttgcc 
tttagatgtg 
acagaaaaac 
tcactagaga 
ttggaagatc 
atgccgccat 
ttcttattcg 
gggtcgtaac 
tctgtgtgtt 
agggggaggg 
agggggaggc 
atagcggccg 
ctagagaata 



tgagactaca 
taaagttgtt 
cgttaatcaa 
caatccggaa 
agcttactgg 
gtacaaaggc 
catcttcgac 
cgttgttgtt 
cagtcaagta 
gaaaggtctt 
gaagggcgga 
aaaagatcat 
gaggccagaa 
ggaggccaga 
gacgcccggg 
atagtaatca 
acttacggta 
aatgacgtat 
ctatttacgg 
ccctattgac 
atgggacttt 
aggtgagccc 
tgtatttatt 
gcgccaggcg 
gcagccaatc 
cggccctata 
gccccgctcc 
caggtgagcg 
cggctcgttt 
tgcggggggg 
cggcccgcgc 
cgcgtgtgcg 
ggggaacaaa 
ggcggtcggg 
cttcgggtgc 
ggcggcaggt 
aggggcgcgg 
ccttttatgg 
ccgaaatctg 
ccggcaggaa 
tccatctcca 
ggcggggttc 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
tccggagcga 
tgattaacag 
aactcgccca 

gggctttgct 
ctttagaagg 
ctttactaag 
agtatgaaac 
atgcattata 
aagagcatca 
tattacgaca 
gccttgaatt 
cggttcgaga 

ggttttttgt 

ggaggccaga 
cagaatgagg 
ccctgaggcc. 
ggaacttcgg 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160. 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

27 60 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 ■ 

4020 

4080. 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 
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aataggaact 
aattctaccg 
ccccgctggc 
cggtagcgcc 
ctagtcagga 
gtagcacgtc 
ggcctttggg 
ggaaggggtg 
gtcctcccga 
cctcttcctc 
caccgcgacg 
gcagctctcg 
tgtcctgcgg 
ctttgcatcg 
cctgacctat 
cgaactgccc 
tcttagccag 
atggcgtgat 
ggacgacacc 
ggactgcccc 
ggacaatggc 
atacgaggtc 
gcgctacttc 
gctccgcatt 
agcttgggcg 
tacacaaatc 
cgatagtgga 
attgatgatc 
ttgttaagaa 
ggtgggggtg 
ctttatgata 
gccagctcat 
ttttctcttg 
atagcctgaa 
tttgccaagt 
ttcctatact 
ccggtaaccg 
cttaagcgc 



tcttataatc 
ggtaggggag 
acttggcgct 
aaccggctcc 
agttcccccc 
tcactagtct 
gcagcggcca 
ggtccggggg 
ggcccggcat 
atctccgggc 
tctgtcgaga 
gagggcgaag 
gtaaatagct 
gccgcgctcc 
tgcatctccc 
gctgttctgc 
acgagcgggt 
ttcatatgcg 
gtcagtgcgt 
gaagtccggc 
cgcataacag 
gccaacatct 
gagcggaggc 
ggtcttgacc 
cagggtcgat 
gcccgcagaa 
aaccgacgcc 
tattaaacaa 
gggtgagaac 
gggtgggatt 
atgt\tcata 
tcctcccact 
attcccactt 
gaacgagatc 
tctaattcca 
ttctagagaa 
aagttcctat 



tagaagatct 
gcgcttttcc 
acacaagtgg 
gttctttggt 
cgccccgcag 
cgtgcagatg 
atagcagctt 
cgggctcagg 
tctcgcacgc 
ctttcgacga 
agtttctgat 
aatctcgtgc 
gcgccgatgg 
cgattccgga 
gccgtgcaca 
agccggtcgc 
tcggcccatt 
cgattgctga 
ccgtcgcgca 
acctcgtgca 
cggtcattga 
tcttctggag 
atccggagct 
aactctatca 
gcgacgcaat 
gcgcggccgt 
ccagcactcg 
taaagatgtc 
agagtaccta 
agataaatgc 
gttggatatc 
catgatctat 
tgtggttcta 
agcagcctct 
tcagaagctg 
taggaacttc 
actttctaga 



31/39 

ggatccacga 

caaggcagtc 

cctctggcct 

ggccccttcg 

ctcgcgtcgt 

gacagcaccg 

tgctccttcg 

ggcgggctca 

ttcaaaagcg 

tccagccgcc 

cgaaaagttc 

tttcagcttc 

tttctacaaa 

agtgcttgac 

^ggtgtcacg. 

ggaggccatg. 

cggaccgcaa 

tccccatgtg 

ggctctcgat 

cgcggatttc 

ctggagcgag 

gccgtggttg 

tgcaggatcg 

gagcttggtt 

cgtccgatcc 

ctggaccgat 

tccgagggca 

cactaaaatg 

cattttgaat 

ctgctcttta 

ataatttaaa 

agatctatag 

agtactgtgg 

gttccacata 

actctagatc 

ggaataggaa 

gaataggaac 
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ttcgagggcc 

tggagcatgc 

cgcacacatt 

cgccaccttc 

gcaggacgtg 

ctgagcaatg 

ctttctgggc 

ggggcggggc 

cacgtctgcc 

accatgaaaa 

gacagcgtct 

gatgtaggag 

gatcgttatg 

attggggaat 

ttgcaagacc 

gatgccatcg 

ggaatcggtc 

tatcactggc 

gagctgatgc 

ggctccaaca 

gcgatgttcg, 

gcttgtatgg 

ccgcggctcc 

gacggcaatt 

ggagccggga 

ggctgtgtag 

aaggaatagt 

gaagtttttc 

ggaaggattg 

ctgaaggctc 

caagcaaaac 

atctctcgtg 

tttccaaatg 

cacttcattc 

ctgcaggaat 

cttcaaaatg 

ttcggaatag 



cctgcaggtc 
gctttagcag 
ccacatccac 
tactcctccc 
acaaatggaa 
gaagcgggta 
tcagaggctg 
gggcgcgaag 
gcgctgttct 
agcctgaact 
ccgacctgat 
ggcgtggata 
tttatcggca 
tcagcgagag 
tgcctgaaac 
ctgcggccga 
aatacactac 
aaactgtgat 
tttgggccga 
atgtcctgac 
gggattccca 
agcagcagac 
gggcgtatat 
tcgatgatgc 
ctgtcgggcg 
aagtactcgc 
cgatgcagaa 
ctgtcatact 
gagctacggg 
tttactattg 
caaattaagg 
ggatcattgt 
tgtcagtttc 
tcagtattgt 
tcatatgaag 
tcgcggcgcg 
gaacttcaag 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7209 



<210> 15 . ■ • . ' 

<211> 7014 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GAGGS-Fliic 



<400> 15 
tagcccgggc 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
•tatttacggt 
cctattgacg 
tgggactttc 
ggtgagcccc 
gtatttattt 
cgccaggcgg 
cagccaatca 
ggccctataa 
ccccgctccg 
aggtgagcgg 



tagcttgcat 

ttacggggtc 

atggcccgcc 

ttcccatagt • 

aaactgccca 

tcaatgacgg 

ctacttggca 

acgttctgct 

attttttaat 

ggcggggcgg 

gagcggcgcg 

aaagcgaagc 

cgccgcctcg 

gcgggacggc 



gcctgcaggt 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tcactctccc 
tattttgtgc 
ggcgaggggc 
ctccgaaagt 
gcgcggcggg 
cgccgcccgc 
ccttctcctc 



tttcgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
catctccccc 
agcgatgggg 
ggggcggggc 
ttccttttat 
cgggagtcgc 
cccggctctg 
cgggctgtaa 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
ccctccccac 
gcgggggggg 
gaggcggaga 
ggcgaggcgg 
tgcgttgcet 
actgaccgcg 
ttagcgcttg 



tagttattaa 60 
cgttacataa 120 
gacgtcaata 180 
atgggtggac 240 
aagtacgccc 300 
catgacctta 360 
catgggtcga 420 
ccccaatttt 480 
ggggggcgcg 540 
ggtgcggcgg 600 
cggcggcggc 660 
tcgccccgtg 720 
ttactcccac 78 0 
gtttaatgac 840 
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ggctcgtttc ttttctgtgg ctgcgtgaaa gccttaaagg gctccgggag ggccctttgt 900 • 

gcggggggga gcggctcggg gggtgcgtgc gtgtgtgtgt gcgtggggag cgccgcgtgc 960 

ggcccgcgct gcccggcggc tgtgagcgct gcgggcgcgg cgcggggctt tgtgcgctcc 1020 

gcgtgtgcgc gaggggagcg cggccggggg cggtgccccg cggtgcgggg gggctgcgag 1080 

gggaacaaag gctgcgtgcg gggtgtgtgc gtgggggggt gagcaggggg tgtgggcgcg 1140 

gcggtcgggc tgtaaccccc ccctgcaccc ccctccccga gttgctgagc acggcccggc 1200 

ttcgggtgcg gggctccgtg cggggcgtgg cgcggggctc gccgtgccgg gcggggggtg 1260 

gcggcaggtg ggggtgccgg gcggggcggg gccgcctcgg gccggggagg gctcggggga 1320 

ggggcgcggc ggccccggag cgccggcggc tgtcgaggcg cggcgagccg cagccattgc 1380 

cttttatggt aatcgtgcga gagggcgcag ggacttcctt tgtcccaaat ctggcggagc 14 4 0 

cgaaatctgg gaggcgccgc cgcaccccct ctagcgggcg cgggcgaagc ggtgcggcgc 1500 

cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg ccgcgccgcc gtccccttct 1560 

ccatctccag cctcggggct gccgcagggg gacggctgcc ttcggggggg acggggcagg 1620 

gcggggttcg gcttctggcg tgtgaccggc ggctctagaa gcgttggggt gagtactccc 1680 

tctcaaaagc gggcatgact tctgcgctaa gattgtcagt ttccaaaaac gaggaggatt 174 0 

tgatattcac ctggcccgcg gtgatgcctt tgagggtggc cgcgtccatc tggtcagaaa 1800 

agacaatctt tttgttgtca agcttgaggt gtggcaggct tgagatctgg ccatacactt 18 60 

gagtgacatt gacatccact ttgcctttct ctccacaggt gtccactcc.c agggcggcct 1920 - 

ccggagcgat cgccggtccg cctaggcaat tgtttaaact cgaggatctg cgatctaagt 1980 

aagcttggca ttccggtact gttggtaaag ccaccatgga agacgccaaa aacataaaga 2040 

aaggcccggc gccattctat ccgctggaag atggaaccgc tggagagcaa ctgcataagg 2100 

ctatgaagag atacgccctg gttcctggaa caattgcttt tacagatgca catatcgagg 2160 

tggacatcac ttacgctgag tacttcgaaa tgtccgttcg gttggcagaa gctatgaaac 2220 

gatatgggct gaatacaaat cacagaatcg tcgtatgcag tgaaaactct cttcaattct 2280 

ttatgccggt gttgggcgcg ttatttatcg gagttgcagt tgcgcccgcg aacgacattt 2340 

ataatgaacg tgaattgctc aacagtatgg gcatttcgca gcctaccgtg gtgttcgttt 2400 

ccaaaaaggg gttgcaaaaa attttgaacg tgcaaaaaaa gctcccaatc atccaaaaaa 24 60 ■ 

ttattatcat ggattctaaa acggattacc agggatttca gtcgatgtac acgttcgtca 2520 

catctcatct acctcccggt tttaatgaat acgattttgt gccagagtcc .ttcgataggg 2580 

acaagacaat tgcactgatc atgaactcct ctggatctac tggtctgcct aaaggtgtcg 2640 

ctctgcctca tagd*actgcc tgcgtgagat tctcgcatgc cagagatcct atttttggca 2700 

atcaaatcat tccggatact gcgattttaa gtgttgttcc attccatcac ggttttggaa 27 60 

tgtttactac actcggatat ttgatatgtg gatttcgagt cgtcttaatg tatagatttg 2820 

aagaagagct gtttctgagg agccttcagg attacaagat tcaaagtgcg ctgctggtgc 2880 

caaccctatt ctccttcttc gccaaaagca ctctgattga caaatacgat ttatctaatt 2940 

tacacgaaat tgcttctggt ggcgctcccc tctctaagga agtcggggaa gcggttgcca 3000 

agaggttcca tctgccaggt atcaggcaag gatatgggct cactgagact acatcagcta 3060 

. ttctgattac acccgagggg gatgataaac cgggcgcggt cggtaaagtt gttccatttt 3120 

ttgaagcgaa ggttgtggat ctggataccg ggaaaacgct- gggcgttaat caaagaggcg 3180 

aactgtgtgt gagaggtcct atgattatgt ccggttatgt aaacaatccg gaagcgacca 3240 

acgccttgat tgacaaggat ggatggctac attctggaga catagcttac tgggacgaag 3300 

acgaacactt cttcatcgtt gaccgcctga agtctctgat taagtacaaa ggctatcagg 3360 

tggctcccgc tgaattggaa tccatcttgc tccaacaccc caacatcttc gacgcaggtg 3420 

tcgcaggtct tcccgacgat gacgccggtg aacttcccgc cgccgttgtt gttttggagc 3480 

acggaaagac gatgacggaa aaagagatcg tggattacgt cgccagtcaa gtaacaaccg 3540 

cgaaaaagtt gcgcggagga gttgtgtttg tggacgaagt accgaaaggt cttaccggaa 3600 

aactcgacgc aagaaaaatc agagagatcc tcataaaggc caagaagggc ggaaagatcg 3 660 

ccgtgtaatt ctagagtcgg ggcggccggc cgcttcgagc agacatgata agatacattg 3720 

atgagtttgg acaaaccaca actagaatgc agtgaaaaaa atgctttatt tgtgaaattt 378.0 

gtgatgctat tgctttattt gtaaccatta taagctgcaa taaacaagtt aacaacaaca 384 0 

attgcattca ttttatgttt caggttcagg gggaggtgtg ggaggttttt taaagcaagt 3900 

aaaacctcta caaatgtggt aaaatcgata aggatctgaa cgatggagcg gagaatgggc 3960 

ggaactgggc ggagttaggg gcgggatggg cggagttagg ggcgggacta tggttgctga 4020 

ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca 4080 

cctggttgct gactaattga gatgcatgct ttgcatactt ctgcctgctg gggagcctgg 414 0 

ggactttcca caccctaact gacacacatt ccacagcgga tccgtcgacc gatgcccttg 4 200 

agagccttca acccagtcag ctccttccgg tgggcgcggg gcatgactat cgtcgccgca 4260 

cttatgactg tcttctttat catgcaactc gtaggacagg tgccggcagc gctcttccgc 4320 

ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 4 380 

ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 4440 

agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 4500 

taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 4560 

cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 4 620 
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tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 4 680 
gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 4740 
gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 4800 
tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 4 8 60 
gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 4 920 
cggctacact agaagaacag. tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 4980 
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 5040 
tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 5100 
ttctacgggg tctgacgctc agtggaacga. aaactcacgt taagggattt tggtcatgag 5160 
attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 5220 
ctaaagtata. tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 5280 
tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat 5340 
aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc 5400 
acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 54 60 
aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 5520 
agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt 5580 
ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg 5640 
agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt 5700 
tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc 57 60 
tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 5820 
attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa 5880 
taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg 5940 
aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc 6000 
caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag 6060 - 
gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 6120 
cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 6180 
tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc 624 0 
acctgacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 6300 
gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 6360 ' 
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 6420 
atttagtgct ttad^gcacc tcgaccccaa aaaacttgat tagggtgatg gttcacgtag 64 80 
tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 654 0 
tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcggtct attcttttga 6600 
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 6660 
atttaacgcg aattttaaca . aaatattaac gcttacaatt tgccattcgc cattcaggct 6720 
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agcccaagct 6780 
accatgataa gtaagtaata ttaaggtacg ggaggtactt ggagcggccg caataaaata 6840 
tctttatttt cattacatct gtgtgttggt tttttgtgtg aatcgatagt actaacatac 6900 
gctctccatc aaaacaaaac gaaacaaaac aaactagcaa aataggctgt .ccccagtgca 6960 
agtgcaggtg ccagaacatt tctctatcga taggtaccga gctcttacgc' gtgc 7014 



<210> 16 

<211> 5430 

<212> DNA • 

<213> Artificial Sequence - : 

<220> 

<223> Description of Artificial Sequence: GAGGS-tetR 



<400> 16 

ccgggctagc 

aatcaattac 

cggtaaatgg 

cgtatgttcc 

tacggtaaac 

ttgacgtcaa 

actttcctac 

agccccacgt 

ttatttattt 

aggcggggcg 

caatcagagc 

ctataaaaag 



ttgcatgcct 

ggggtcatta 

cccgcctggc 

catagtaacg 

tgcccacttg 

tgacggtaaa 

ttggcagtac 

tctgcttcac 

tttaattatt 

gggcggggcg 

ggcgcgctcc 

cgaagcgcgc 



gcaggttttc 
gttcatagcc 
tgaccgccca 
ccaataggga 
gcagtacatc 
tggcccgcct 
atctacgtat 
tctccccatc 
ttgtgcagcg 
aggggcgggg 
gaaagtttcc 
ggcgggcggg. 



gacattgatt 

catatatgga 

acgacccccg. 

ctttccattg 

aagtgtatca 

ggcattatgc 

tagtcatcgc 

tcccccccct 

atgggggcgg 

cggggcgagg 

ttttatggcg 

agtcgctgcg 



attgactagt 
gttccgcgtt 
cccattgacg 
acgtcaatgg 
tatgccaagt 
ccagtacatg 
tattaccatg 
ccccaccccc 
gggggggggg 

cggagaggtg 
aggcggcggc 
ttgccttcgc 



tattaatagt 60 
acataactta 120 
tcaataatga 180 
gtggactatt 240 
acgcccccta 300 
accttatggg 360 
ggtcgaggtg 420 
aattttgtat 480 
ggcgcgcgcc 540 
cggcggcagc 600 
ggcggcggcc 660 
cccgtgcccc 720 
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gctccgcgcc gcctcgcgcc gcccgccccg 
gagcgggcgg gacggccctt ctcctccggg 
cgtttctttt ctgtggctgc gtgaaagcct 
gggggagcgg ctcggggggt gcgtgcgtgt 
cgcgctgccc ggcggctgtg agcgctgcgg 
gtgcgcgagg ggagcgcggc cgggggcggt 
acaaaggctg cgtgcggggt gtgtgcgtgg 
tcgggctgta acccccccct gcacccccct 
ggtgcggggc tccgtgcggg gcgtggcgcg 
caggtggggg tgccgggcgg ggcggggccg 
cgcggcggcc ccggagcgcc ggcggctgtc 
tatggtaatc gtgcgagagg gcgcagggac 
atctgggagg cgccgccgca ccccctctag 
aggaaggaaa tgggcgggga gggccttcgt 
ctccagcctc ggggctgccg cagggggacg 
ggttcggctt ctggcgtgtg accggcggct 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tgaggtgtgg 
gacattgaca tccactttgc ctttctctcc 
•agcgatcgcc gatccgccta gcattcaacc 
aacagcgcat tagagctgct taatgaggtc 
gcccagaagc taggtgtaga gcagcctaca 
ttgctcgacg ccttagccat tgagatgtta 
gaaggggaaa gctggcaaga ttttttacgt 
ctaagtcatc gcgatggagc aaaagtacat 
gaaactctcg aaaatcaatt agccttttta 
ttatatgcac tcagcgctgt ggggcatttt 
catcaagtcg ctaaagaaga aagggaaaca 
cgacaagcta tcgaattatt tgatcaccaa 
gaattgatca tat^cggatt agaaaaacaa 
cgagatccag gcgcggatca ataaaagatc 
tttgtgtgcc ttgggggagg gggaggccag 
ccagaatgac cttgggggag ggggaggcca 
tgaggcgcgc ccccgggtac cgagctcgaa 
ctgggaaaac cctggcgtta cccaacttaa 
ctggcgtaat agcgaagagg cccgcaccga 
tggcgaatgg cgcctgatgc ggtattttct 
catatggtgc actctcagta caatctgctc 
cccgccaaca cccgctgacg cgccctgacg 
acaagctgtg accgtctccg ggagctgcat 
acgcgcgaga cgaaagggcc tcgtgatacg 
aatggtttct tagacgtcag gtggcacttt 
tttatttttc taaatacatt caaatatgta 
gcttcaataa tattgaaaaa ggaagagtat 
tccctttttt gcggcatttt gccttcctgt 
aaaagatgct gaagatcagt tgggtgcacg 
cggtaagatc cttgagagtt ttcgccccga 
agttctgcta tgtggcgcgg tattatcccg 
ccgcatacac tattctcaga atgacttggt 
tacggatggc atgacagtaa gagaattatg 
tgcggccaac ttacttctga caacgatcgg 
caacatgggg gatcatgtaa ctcgccttga 
accaaacgac gagcgtgaca ccacgatgcc 
attaactggc gaactactta ctctagcttc 
ggataaagtt gcaggaccac ttctgcgctc 
taaatctgga gccggtgagc gtgggtctcg 
taagccctcc cgtatcgtag ttatctacac 
aaatagacag atcgctgaga taggtgcctc 
agtttactca tatatacttt agattgattt 
ggtgaagatc ctttttgata atctcatgac 
ctgagcgtca gaccccgtag aaaagatcaa 
cgtaatctgc tgcttgcaaa caaaaaaacc 
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gctctgactg accgcgttac tcccacaggt 780. 
ctgtaattag cgcttggttt aatgacggct 840 
taaagggctc cgggagggcc ctttgtgcgg 900 
gtgtgtgcgt ggggagcgcc gcgtgcggcc 960 
gcgcggcgcg gggctttgtg cgctccgcgt 1020 
gccccgcggt gcgggggggc tgcgagggga 1080 
gggggtgagc agggggtgtg ggcgcggcgg 1140 • 
ccccgagttg ctgagcacgg cccggcttcg 1200 • 
gggctcgccg tgccgggcgg ggggtggcgg 1260 
cctcgggccg gggagggctc gggggagggg 1320 
gaggcgcggc gagccgcagc cattgccttt 1380 
ttcctttgtc ccaaatctgg cggagccgaa 14 40 
cgggcgcggg cgaagcggtg cggcgccggc 1500 
gcgtcgccgc gccgccgtcc ccttctccat 1560 
gctgccttcg ggggggacgg ggcagggcgg 1620 
ctagaagcgt tggggtgagt actccctctc 1680 
gtcagtttcc aaaaacgagg aggatttgat 17 4 0 
ggtggccgcg tccatctggt cagaaaagac 1800 
caggcttgag atctggccat acacttgagt 1860 
acaggtgtcc actcccaggg cggcctccgg 1920 
atggctagat tagataaaag taaagtgatt 1980 
ggaatcgaag gtttaacaac ccgtaaactc 2040 
ttgtattggc atgtaaaaaa taagcgggct 2100 
gataggcacc atactcactt ttgcccttta 2160 
aataacgcta aaagttttag atgtgcttta 2220 
ttaggtacac ggcctacaga aaaacagtat 2280 
tgccaacaag gtttttcact agagaatgca 2340 
actttaggtt gcgtattgga agatcaagag 2400 
cctactactg. atagtatgcc gccattatta 24 60 
ggtgcagagc cagccttctt attcggcctt 2520 
cttaaatgtg aaagtgggtc gtaaccggtt 2580 
attattttca atagatctgt gtgttggttt 2640 
aatgaggcgc ggccaagggg gagggggagg 2700 
gaatgacctt gggggagggg gaggccagaa 2.7 60 
ttcactggcc gtcgttttac aacgtcgtga 2820 
tcgccttgca gcacatcccc ctttcgccag 2880 
tcgcccttcc caacagttgc gcagcctgaa 2 94 0 
ccttacgcat ctgtgcggta tttcacaccg 3000 
tgatgccgca tagttaagcc agccccgaca 3060 
ggcttgtctg ctcccggcat ccgcttacag 3120 
gtgtcagagg ttttcaccgt catcaccgaa 3180 
•cctattttta taggttaatg tcatgataat 3240 
tcggggaaat gtgcgcggaa cccctatttg 3300 
tccgctcatg agacaataac cctgataaat 3360 
gagtattcaa catttccgtg tcgcccttat 3420 
ttttgctcac ccagaaacgc tggtgaaagt 3480 
agtgggttac atcgaactgg atctcaacag 3540 
agaacgtttt ccaatgatga gcacttttaa 3600 
tattgacgcc gggcaagagc aactcggtcg 3660 
tgagtactca ccagtcacag aaaagcatct 3720 
cagtgctgcc ataaccatga gtgataacac 3780 
aggaccgaag gagctaaccg cttttttgca 3840 
tcgttgggaa ccggagctga atgaagccat 3900 
tgtagcaatg gcaacaacgt tgcgcaaact 3960 
ccggcaacaa ttaatagact ggatggaggc 4 020 
ggcccttccg gctggctggt ttattgctga 4080 
cggtatcatt gcagcactgg ggccagatgg 4140 
gacggggagt caggcaacta tggatgaacg 4200 
actgattaag cattggtaac tgtcagacca 4260 
aaaacttcat ttttaattta aaaggatcta 4 320 
caaaatccct taacgtgagt . tttcgttcca 4380 
aggatcttct tgagatcctt tttttctgcg 4440 
accgctacca gcggtggttt gtttgccgga 4500 
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tcaagagcta 
tactgtcctt 
tacatacctc 
.tcttaccggg 

ggggggttcg 
acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagtca 
gcgttggccg 
tgagcgcaac 
tatgcttccg 
cagctatgac 



ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
^ggcrtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
attcattaat 
gcaattaatg 
gctcgtatgt 
catgattacg 



ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcagctggca 
tgagttagct 
tgtgtggaat 
ccaagctagc 
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aactggcttc 
ccaccacttc 
agtggctgct 
accggataag 
gcgaacgacc 
tcccgaaggg 
cacgagggag 
cctctgactt 
cgccagcaac 
ctttcctgcg 
taccgctcgc 
gcgcccaata 
cgacaggttt 
cactcattag 
tgtgagcgga 



agcagagcgc 
aagaactctg 
gccagtggcg 
gcgcagcggt 
tacaccgaac 
agaaaggcgg 
cttccagggg 
gagcgtcgat 
gcggcctttt 
ttatcccctg 
cgcagccgaa 
cgcaaaccgc 
cccgactgga 
gcaccccagg 
taacaatttc 



agataccaaa 
tagcaccgcc 
ataagtcgtg 
cgggctgaac 
tgagatacct 
acaggtatcc 
gaaacgcctg 
ttttgtgatg 
tacggttcct 
attctgtgga 
cgaccgagcg 
ctctccccgc 
aagcgggcag 
ctttacactt 
acacaggaaa 
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4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5430 



<210> 17 
<211> 7332 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial ■ Sequence : 



CMV-LacZ 



<400> 17 

aaacagtccg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctcca.cccca 

aaatgtcgta 

gtctatataa 

attaatacga 

tgagtactcc 

■ cgaggaggat 
ctggtcagaa 
gccatacact' 
cagggcggcc 
tgtcgtttac 
tggacaccag 
aaaaccctgg 
gtaatagcga 
aatggcgctt 
atcttcctga 
cgcccatcta 
agaatccgac 
gccagacgcg 
gctgggtcgg 
tacgcgccgg 
tggaagatca 
aaccgactac 
gcgctgtact 
cagtttcttt 
aaattatcga 
acccgaaact . 
ccgccgacgg 

.ttgaaaatgg 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcdGatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
gcaattcccg 
tttgaccaac 
caaggagctg 
cgttacccaa 
agaggcccgc 
tgcctggttt 
ggccgatact 
caccaacgta 
gggttgttac 
aattattttt 
ttacggccag 
agaaaaccgc 
ggatatgtgg 
acaaatcagc 
ggaggctgaa 
atggcagggt 
tgagcgtggt 
gtggagcgcc 
cacgctgatt 
tctgctgctg 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
cgggcatgac 
cctggcccgc 
ttttgttgtc 
tgacatccac 
gggatcgaaa 
aagaacgtga 
ctcaagcgcg 
cttaatcgcc 
accgatcgcc 
ccggcaccag 
gtcgtcgtcc 
acctatccca 
tcgctcacat 
gatggcgtta 
gacagtcgtt 
ctcgcggtga 
cggatgagcg 
gatttccatg 
gttcagatgt 
gaaacgcagg 
ggttatgccg 
gaaatcccga 
gaagcagaag 
ctgaacggca 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 
ttctgcgcta 
ggtgatgcct 
aagcttgagg 
tttgcctttc 
gagcctgcta 
ttttcgttgc 
atcccgtcgt 
ttgcagcaca 
cttcccaaca 
aagcggtgcc 
cctcaaactg 
ttacggtcaa 
ttaatgttga 
actcggcgtt 
tgccgtctga 
tggtgctgcg 
gcattttccg 
ttgccactcg 
gcggcgagtt 
tcgccagcgg 
atcgcgtcac 
atctctatcg 
cctgcgatgt 
agccgttgct 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gacttaatta 
agattgtcag 
ttgagggtgg 
tgtggcaggc 
tctccacagg 
aagcaaaaaa 
cggtctggga 
tttacaacgt 
tccccctttc 
gttgcgcagc 
ggaaagctgg 
gcagatgcac 
tccgccgttt 
tgaaagctgg 
tcatctgtgg 
atttgacctg 
ttggagtgac 
tgacgtctcg 
ctttaatgat 
gcgtgactac 
caccgcgcct 
actacgtctg 
tgcggtggtt 
cggtttccgc 
gattcgaggc 



tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
agcgttgggg 
tttccaaaaa- 
ccgcgtccat 
ttgagatctg 
tgtccactcc 
gaagtcacca 
ggcattggtc 
cgtgactggg 
gccagctggc 
ctgaatggcg 
ctggagtgcg 
ggttacgatg 
gttcccacgg 
ctacaggaag 
tgcaacgggc 
agcgcatttt 
ggcagttatc 
ttgctgcata 
gatttcagcc 
ctacgggtaa 
ttcggcggtg 
aacgtcgaaa 
gaactgcaca 
gaggtgcgga 
gttaaccgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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acgagcatca tcctctgcat ggtcaggtca tggatgagca gacgatggtg caggatatcc 2220 
tgctgatgaa gcagaacaac tttaacgccg tgcgctgttc gcattatccg' aaccatccgc 2280 
tgtggtacac gctgtgcgac cgctacggcc tgtatgtggt ggatgaagcc aatattgaaa 2340 
cccacggcat ggtgccaatg aatcgtctga ccgatgatcc gcgctggcta ccggcgatga 2400 
gcgaacgcgt aacgcgaatg gtgcagcgcg atcgtaatca cccgagtgtg atcatctggt 2460 
cgctggggaa tgaatcaggc cacggcgcta atcacgacgc gctgtatcgc tggatcaaat 2520 
ctgtcgatcc ttcccgcccg gtgcagtatg aaggcggcgg' agccgacacc acggccaccg 2580 
atattatttg cccgatgtac gcgcgcgtgg atgaagacca gcccttcccg gctgtgccga 2640 
aatggtccat caaaaaatgg ctttcgctac .ctggagagac gcgcccgctg atcctttgcg 2700 
aatacgccca cgcgatgggt aacagtcttg gcggtttcgc taaatactgg caggcgtttc 2760 
gtcagtatcc ccgtttacag ggcggcttcg tctgggactg ggtggatcag tcgctgatta 2820 
aatatgatga aaacggcaac ccgtggtcgg cttacggcgg tgattttggc gatacgccga 2880 
acgatcgcca gttctgtatg aacggtctgg tctttgccga ccgcacgccg catccagcgc 2940 
tgacggaagc aaaacaccag cagcagtttt tccagttccg tttatccggg caaaccatcg 3000 
aagtgaccag cgaatacctg ttccgtcata gcgataacga gctcctgcac tggatggtgg 3060 
cgctggatgg taagccgctg gcaagcggtg aagtgcctct ggatgtcgct ccacaaggta 3120 
aacagttgat tgaactgcct gaactaccgc agccggagag cgccgggcaa ctctggctca 3180 
cagtacgcgt agtgcaaccg aacgcgaccg catggtcaga agccgggcac atcagcgcct 3240 
ggcagcagtg gcgtctggcg gaaaacctca gtgtgacgct ccccgccgcg tcccacgcca 3300 
tcccgcatct gaccaccagc gaaatggatt tttgcatcga gctgggtaat aagcgttggc 3360 
aatttaaccg ccagtcaggc tttctttcac .agatgtggat tggcgataaa aaacaactgc 3420 
tgacgccgct gcgcgatcag ttcacccgtg caccgctgga taacgacatt ggcgtaagtg 3480 
aagcgacccg cattgaccct aacgcctggg tcgaacgctg gaaggcggcg ggccattacc 3540 
aggccgaagc agcgttgttg cagtgcacgg cagatacact tgctgatgcg gtgctgatta 3600 
cgaccgctca cgcgtggcag catcagggga aaaccttatt tatcagccgg aaaacctacc 3660 
ggattgatgg tagtggtcaa atggcgatta ccgttgatgt tgaagtggcg agcgatacac 3720 
cgcatccggc gcggattggc ctgaactgcc agctggcgca ggtagcagag cgggtaaact 3780 
ggctcggatt agggccgcaa gaaaactatc ccgaccgcct tactgccgcc tgttttgacc 3840 
gctgggatct gccattgtca gacatgtata ccccgtacgt cttcccgagc gaaaacggtc 3900 
tgcgctgcgg gacgcgcgaa ttgaattatg gcccacacca gtggcgcggc gacttccagt 3960 
tcaacatcag ccgdtacagt caacagcaac • tgatggaaac cagccatcgc catctgctgc" 4020 • 
acgcggaaga aggcacatgg ctgaatatcg acggtttcca tatggggatt ggtggcgacg 4 080 
actcctggag cccgtcagta tcggcggaat tacagctgag cgccggtcgc taccattacc 4140 
agttggtctg gtgtcaaaaa taataataac cgggcaggcc atgtctgccc gtatttcgcg 4200 
taaggaaatc cattatgtac tatttaaaaa acacaaactt ttggatgttc ggtttattct 4260 
ttttctttta cttttttatc atgggagcct acttcccgtt tttcccgatt tggctacatg 4320- 
acatcaacca tatcagcaaa agtgatacgg gtattatttt tgccgctatt tctctgttct 4380 
cgctattatt ccaaccgctg tttggtctgc tttctgacaa actcggcctc gactctaggc 4440 
ggccgcgtcg acctcgagat ccaggcgcgg atcaataaaa gatcattatt ttcaatagat 4500 
ctgtgtgttg gttttttgtg tgccttgggg gagggggagg ccagaatgag gcgcggccaa 4560 
gggggagggg gaggccagaa tgaccttggg ggagggggag gccagaatga ccttggggga 4620 
gggggaggcc agaatgaggc gcgcccccgg gtaccgagct cgaattcact ggccgtcgtt 4 680 
ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat 4740 
ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag 4800 
ttgcgcagcc ■ tgaatggcga atggcgcctg atgcggtatt ttctccttac gcatctgtgc 4860 
ggtatttcac accgcatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta 4920 
agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg 4980 
gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca 5040 
ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt 5100 
aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc 5160 
ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa 5220- 
taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc 5280 
cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa 5340 • 
acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa 5400 
ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg 54 60 
atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa 5520 
gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc 5580 
acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc 5640 
atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta 5700 
accgcttttt tgcacaacat gggggatcat gtaactcgcc. ttgatcgttg ggaaccggag 5760 
ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca 5820 
acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata 5880 
gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc 5940 
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tggtttattg 
ctggggccag 
actatggatg 
taactgtcag 
tttaaaagga 
gagttttcgt 
cctttttttc 
gtttgtttgc 
gcgcagatac 
tctgtagcac 
ggcgataagt 
cggtcgggct 
gaactgagat 
gcggacaggt 
gggggaaacg 
cgatttttgt 
tttttacggt 
cctgattctg 
cgaacgaccg 
ccgcctctcc 
tggaaagcgg 
caggctttac 
tttcacacag 
gcctgcaggt 



ctgataaatc 
atggtaagcc 
aacgaaatag 
accaagttta 
tctaggtgaa 
tccactgagc 
tgcgcgtaat 
cggatcaaga 
caaatactgt 
cgcctacata 
cgtgtcttac 
gaacgggggg 
acctacagcg 
atccggtaag 
cctggtatct 
gatgctcgtc 
tcctggcctt 
tggataaccg 
agcgcagcga 
ccgcgcgttg 
gcagtgagcg 
actttatgct 
gaaacagcta 
tt 




tggagccggt 
ctcccgtatc 
acagatcgct 
ctcatatata 
gatccttttt. 
gtcagacccc 
ctgctgcttg 
gcta.ccaact 
ccttctagtg 
cctcgctctg 
cgggttggac 
ttcgtgcaca 
tgagctatga 
cggcagggtc 
ttatagtcct 
aggggggcgg 
ttgctggcct 
tattaccgcc 
gtcagtgagc 
gccgattcat 
caacgcaatt 
tccggctcgt 
tgaccatgat 



37/39 
gagcgtgggt 
gtagttatct 
gagataggtg 
ctttagattg 
gataatctca 
gtagaaaaga 
caaacaaaaa 
ctttttccga 
tagccgtagt 
ctaatcctgt 
tcaagacgat 
cagcccagct 
gaaagcgcca 
ggaacaggag 
gtcgggtttc 
agcctatgga 
tttgctcaca 
tttgagtgag 
gaggaagcgg 
taatgcagct 
aatgtgagtt 
atgttgtgtg 
tacgccaagc 
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ctcgcggtat 
acacgacggg 
cctcactgat 
atttaaaact 
tgaccaaaat 
tcaaaggatc 
aaccaccgct 
aggtaactgg 
taggccacca 
taccagtggc 
agttaccgga 
tggagcgaac 
cgcttcccga 
agcgcacgag 
gccacctctg 
aaaacgccag 
tgttctttcc 
ctgataccgc 
aagagcgccc 
ggcacgacag 
agctcactca 
gaattgtgag 
tagcccgggc 



cattgcagca 
gagtcaggca 
taagcattgg 
tcatttttaa 
cccttaacgt 
ttcttgagat 
accagcggtg 
cttcagcaga 
cttcaagaac 
tgctgccagt 
taaggcgcag 
gacctacacc 
agggagaaag 
ggagcttcca 
acttgagcgt 
caacgcggcc 
tgcgttatcc 
tcgccgcagc 
aatacgcaaa 
gtttcccgac 
ttaggcaccc 
cggataacaa 
tagcttgcat 



6000 
6060 
6120- 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7332 



<210> 18 
<211> 5878 

<212> DNA ' ' ■ * 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: CAGGS-cre 



<400> 18 

ttcgacattg 

gcccatatat 

ccaacgaccc 

ggactttcca 

atcaagtgta 

cctggcatta 

tattagtcat 

atctcccccc 

gcgatggggg 

gggcggggcg 

tccttttatg 

gggagtcgct 

ccggctctga 

gggctgtaat 

ccttaaaggg 

tgtgtgtgtg 

cgggcgcggc 

ggtgccccgc 

tgggggggtg 

cctccccgag 

gcggggctcg 

ccgcctcggg 

gtcgaggcgc 

gacttccttt 

tagcgggcgc 

cgtgcgtcgc 

acggctgcct 

gctctagtaa 



attattgact 
ggagttccgc 
ccgcccattg 
ttgacgtcaa 
tcatatgcca 
tgcccagtac 
cgctattacc 
cctccccacc 
cggggggggg 
aggcggagag 
gcgaggcggc 
gcgttgcctt 
ctgaccgcgt 
tagcgcttgg 
ctccgggagg 
cgtggggagc 
gcggggcttt 
ggtgcggggg 
agcagggggt 
ttgctgagca 
ccgtgccggg 
ccggggaggg 
ggcgagccgc 
gtcccaaatc 
gggcgaagcg 
cgcgccgccg 
tcggggggga 
gcgttggggt 



agttattaat 
gttacataac 
acgtcaataa 
tgggtggact 
agtacgcccc 
atgaccttat 
atgggtcgag 
cccaattttg 

gggggcgcgc 

gtgcggcggc 
ggcggcggcg 
cgccccgtgc 
tactcccaca 
tttaatgacg 
gccctttgtg 
gccgcgtgcg 
gtgcgctccg 
ggctgcgagg 
gtgggcgcgg 
cggcccggct 
cggggggtgg 
ctcgggggag 
agccattgdc 
tggcggagcc 
gtgcggcgcc 
tccccttctc 

cggggcaggg 

gagtactccc 



agtaatcaat 
ttacggtaaa 
tgacgtatgt 
atttacggta 
ctattgacgt 
gggactttcc 
gtgagcccca 
tatttattta 
gccaggcggg 
agccaatcag 
gccctataaa 
cccgctccgc 
ggtgagcggg 
gctcgtttct 
cgggggggag 
gcccgcgctg 
cgtgtgcgcg 
ggaacaaagg 
cggtcgggct 
tcgggtgcgg 
cggcaggtgg 

gggcgcggcg 
ttttatggta 
gaaatctggg 
ggcaggaagg 
catctccagc 
cggggttcgg 
tctcaaaagc 



tacggggtca 
tggcccgcct 
tcccatagta 
aactgcccac 
caatgacggt 
tacttggcag 
cgttctgctt 
ttttttaatt 
gcggggcggg 
agcggcgcgc 
aagcgaagcg 
gccgcctcgc 
cgggacggcc 
tttctgtggc 
cggctcgggg 
cccggcggct 
aggggagcgc 
ctgcgtgcgg 
gtaacccccc 
ggctccgtgc 
gggtgccggg 
gccccggagc 
atcgtgcgag 
aggcgccgcc 
aaatgggcgg 
ctcggggctg 
cttctggcgt 
gggcatgact 



ttagttcata 

ggctgaccgc 

acgccaatag 

ttggcagtac 

aaatggcccg 

tacatctacg 

cactctcccc 

attttgtgca 

gcgaggggcg 

tccgaaagtt 

cgcggcgggc 

gccgcccgcc 

cttctcctcc- 

tgcgtgaaag 

ggtgcgtgcg 

gtgagcgctg 

ggccgggggc- 

ggtgtgtgcg 

cctgcacccc 

ggggcgtggc 

cggggcgggg 

gccggcggct 

agggcgcagg 

gcaccccctc 

ggagggcctt 

ccgcaggggg 

gtgaccggcg 

tctgcgctaa 



60 

120 

180 

240 . 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620. 

1680 
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gattgtcagt 
tgagggtggc 
gtggcaggct 
ctccacaggt 
ccaatttact 
aggttcgcaa 
ggaaaatgct 
aatggtttcc 
gtctggcagt 
ccgggctgcc 
aagaaaacgt 
tcgaccaggt 
catttctggg 
ttaaagatat 
cgctggttag 
agcgatggat 
tcagaaaaaa 
aagggatttt 
gatacctggc 
ctggagtttc 
tgaactatat 
gcgattagcc 
tcgagat'cca 
ttttgtgtgc 
gccagaatga 
atgaggcgcg 
actgggaaaa 
gctggcgtaa 
atggcgaatg 
gcatatggtg 
acccgccaac 
gacaagctgt 
aacgcgcgag 
taatggtttc 
gtttattttt 
tgcttcaata 
ttcccttttt 
taaaagatgc 
gcggtaagat 
aagttctgct 
gccgcataca 
ttacggatgg 
ctgcggccaa 
acaacatggg 
taccaaacga 
tattaactgg 
cggataaagt 
ataaatctgg 
gtaagccctc 
gaaatagaca 
aagtttactc 
aggtgaagat 
actgagcgtc 
gcgtaatctg 
atcaagagct 
atactgtcct 
ctacatacct 
gtcttaccgg 
cggggggttc 
tacagcgtga 
cggtaagcgg 
ggtatcttta 
gctcgtcagg 



ttccaaaaac 
cgcgtccatc 
tgagatctgg 
gtccactccc 
gaccgtacac 
gaacctgatg 
tctgtccgtt 
cgcagaacct 
aaaaactatc' 
acgaccaagt 
tgatgccggt 
tcgttcactc 
gattgcttat 
ctcacgtact 
caccgcaggt 
ttccgtctct 
tggtgttgcc 
tgaagcaact 
ctggtctgga 
aataccggag 
ccgtaacctg 
attaacgcgt 
ggcgcggatc 
cttgggggag 
ccttggggga 
cccccgggta 
ccctggcgtt 
tagcgaagag 
gcgcctgatg 
cactctcagt 
acccgctgac 
gaccgtctcc 
acgaaagggc 
ttagacgtca 
ctaaatacat 
atattgaaaa 
tgcggcattt 
tgaagatcag 
ccttgagagt 
atgtggcgcg 
ctattctcag 
catgacagta 
cttacttctg 
ggatcatgta 
cgagcgtgac 
cgaactactt 
tgcaggacca 
agccggtgag 
ccgtatcgta 
gatcgctgag 
atatatactt 
cctttttgat 
agaccccgta 
ctgcttgcaa 
accaactctt 
tctagtgtag 
cgctctgcta 
gttggactca 
gtgcacacag 
gctatgagaa 
cagggtcgga 
tagtcctgtc 
ggggcggagc 



gaggaggatt 
tggtcagaaa 
ccatacactt 
agggcggcct 
caaaatttgc 
gacatgttca 
tgccggtcgt 
gaagatgttc 
cagcaacatt 
gacagcaatg 
gaacgtgcaa 
atggaaaata 
aacaccctgt 
gacggtggga 
gtagagaagg 
ggtgtagctg 
gcgccatctg 
catcgattga 
cacagtgccc 
atcatgcaag 
gatagtgaaa 
aaatgattgc 
aataaaagat 
ggggaggcca 
gggggaggcc 
ccgagctcga 
acccaactta 
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tggccttttg 
ataaccgtat 
gcagcgagtc 
cgcgttggcc 
gtgagcgcaa 
ttatgcttcc 
acagctatga 
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gattcattaa 
cgcaattaat 
ggctcgtatg 
ccatgattac 



gctcacatgt 
gagtgagctg 
gaagcggaag 
tgcagctggc 
gtgagttagc 
ttgtgtggaa 
gccaagctag 
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tcactcatta 
ttgtgagcgg 
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